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Abstract 

The 201 1 Les Houches workshop was the first to confront LHC data. In the two 
years since the previous workshop there have been significant advances in both 
soft and hard QCD, particularly in the areas of multi-leg NLO calculations, the 
inclusion of those NLO calculations into parton shower Monte Carlos, and 
the tuning of the non-perturbative parameters of those Monte Carlos. These 
proceedings describe the theoretical advances that have taken place, the impact 
of the early LHC data, and the areas for future development. 

Report of the SM and NLO Multileg and SM MC Working Groups for the Workshop "Physics at TeV 
Colliders", Les Houches, France, 31 May— 8 June, 20 IL 
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Parti 

INTRODUCTION 

The workshop in 2011 was the first for which the long-awaited LHC data (at 7 TeV) was available for 
analysis and comparison to theory. Even though of limited statistical power compared to the ultimate 
goals of the LHC, this data accesses a very wide kinematic range, and probes regions where multiple 
scales are important. The presence of large scales for some processes, on the TeV level, points to the 
importance of electroweak corrections, which have been calculated only for some of the important pro- 
cesses. The first hints of a Higgs boson have now been observed. In order to search for signs of New 
Physics, as well as to completely understand the Standard Model at the LHC, it is important to understand 
the perturbative framework at the LHC. The data taken so far provides many challenges for perturbative 
QCD predictions; and it is clear that New Physics, if it is present in current data, is hiding well. 

On the theoretical side, there has been a great deal of productivity in the area of multi-particle 
calculations at next-to-leading order (NLO) and next-to-next-to-leading order (NNLO). NLO is the first 
order at which the normalization, and in some cases the shape, of perturbative cross sections can be 
considered reliable [IJ. A full understanding for both Standard Model and beyond the Standard Model 
physics at the LHC requires the development of fast, reliable programs for the calculation of multi-parton 
final states at NLO. There have been many advances in the development of NLO techniques, especially 
in the area of automation 1^ [1 [1 [3 [a [3 il [1 Ha [m m • 

Some of these approaches also allow for relatively easy llT3l [T4l [T2l and/or automatic 1,15,1 inclu- 
sion of the NLO matrix elements into parton shower Monte Carlo programs. For more details we refer 
to the individual contributions in these proceedings. 

A prioritized list of NLO (and some NNLO) cross sections was assembled at Les Houches in 
2005 [16 1 and added to in 2007 fTT'l and 2009 [18|. This list includes cross sections which are ex- 
perimentally important, and which are theoretically feasible (if difficult) to calculate. As we stand 
now, basically all NLO 2 — )• 3 and 2 — )• 4 cross sections of interest have been calculated, see Ta- 
bles T]2 below, and even some processes which were not on the 2009 wishlist are available at NLO, see 



e.g. il9ii20ll2n[22l[23ll24ll25ll . The success of automation techniques means that future NLO calcula- 
tions of similar complexity can be completed without the man-years of labor previously required. Thus, 
we do not add to the NLO wish list in 201 1. Instead, we comment on calculations needed at NNLO, and 
processes at NLO for which it is important to calculate the impact of electroweak corrections, and/or the 



influence of interference effects with other processes with the same final state. 

For many of the processes calculated at the LHC (such as for Higgs production), it is important 
either to apply a veto for the production of extra jets, or to bin the analysis results according to the jet 
multiplicity. While such cuts are useful for dealing with the experimental backgrounds, the exclusivity 
of the cross sections results in increases to the theoretical uncertainties obtained for the corresponding 
inclusive results, see e.g. ||26]| . The impact of such cuts is explored in the contribution of Stewart and 
Tackmann in these proceedings. 

Much of the complexity for multi-parton NLO processes consists of the calculation of the non- 
leading color contributions. Such contributions typically contribute only at the level of a few percent and 
approximations to the non-leading color contributions should be accurate within a percent or so ITOl ITTl 
\5T\. So it may be more time-prudent for groups carrying out such calculations to estimate the non-leading 
color effects before carrying out the full calculation. 

To reach full utility, the codes for any of these complex NLO calculations should be made public 
and/or the authors should generate ROOT ntuples providing the parton level event information from 
which experimentalists can assemble any cross sections of interest. Where possible, decays (with spin 
correlations) should be included. A ROOT output option is especially useful where the creation of a 
user-friendly NLO program may be very time-consuming. We now have some experience with the use 
of ROOT ntuples with both MCFM and Blackhat-i-Sherpa calculations. The latter, in particular, does 
not exist as a public program, while ROOT tuples have been made available for NLO W/Z -i- n jet 
multiplicities (with n up to 4) for W/Z -i- jets, and (also for n up to 4) for inclusive jet production. The 
estimation of the correct scale for use in multi-parton NLO calculations, and the proper evaluation of the 
uncertainty on this scale, is more complex than for simpler calculations. The use of ROOT ntuples can 
make these evaluations easier to carry out. A contribution describing their use has been included in these 
proceedings. 

While NLO is sufficient for most predictions, it is also crucial to understand certain critical cross 
sections at NNLO. To date, NNLO calculations have been carried out primarily for processes in e+e^ 
annihilation 1172117311741 . and in hadronic collisions for 2 — ;■ 1 processes, with the exception of VH IJTSl 
|76l|77l and 77 production IJTSl . 

To calculate a 2 — ;• 2 scattering process at NNLO, the divergent contributions arising from the tree- 
level 2 — ;• 4, the one-loop 2 — ;■ 3 and the two-loop 2 — ;• 2 subprocesses have to be properly subtracted 
and cancelled, such that the finite remainders can be combined into a parton-level event generator. To 
combine the three contributions, an infra-red subtraction scheme for unresolved real radiation is required. 
Several approaches have been used and are being further developed: antenna subtraction [79], which 
currently is extended to hadronic and semi-hadronic initial states [80l|8Tl|82l|83l|84l, a method based on 
sector decomposition appplied to real radiation |85 . 86 . 87 1 where the decomposition is guided by the 
physical singularity structure Il88ll89i . gr-subtraction ll90ll . which is very elegant but appplicable only to 
colourless final states, and the one of W\\ described in these proceedings. 

Further, two-loop amplitudes are interesting in their own right from a field theory point of view, for 
example to study asymptotic behaviour, or to gain insights into the all-order infared structure of massless 
field theories. 

Below we construct a table of calculations needed at the LHC, and which are feasible within the 
next few years. Certainly, results for inclusive cross sections at NNLO will be easier to achieve than 
differential distributions, but most groups are working towards a partonic Monte Carlo program capable 
of producing fully differential distributions for measured observables. 

• it production: 

needed for accurate background estimates, top mass measurement, top quark asymmetry (which is 
zero at tree level, so NLO is the leading non- vanishing order for this observable, and a discrepancy 
of theory predictions with Tevatron data needs to be understood). Several groups are already well 



Process {V £ {Z, W, 7}) 


Comments 


Calculations completed since Les Houches 2005 




I. pp ^ yyjet 


WW )et completed bv Dittmaier/Kallweit/Uwer Il27ll28l: 




Campbell/Ellis/Zanderighi ||29l. 




ZZ jet completed by 




Binoth/Gleisberg/Karg/Kauer/Sanguinetti ||30| 




WZ ]et.W^ )et completed bv Campanario et al. 03111321 


2. pp ^ Higgs+2jets 


NLO QCD to the gg channel 




completed by Campbell/Ellis/Zanderighi ||33|; 




NLO QCD+EW to the VBF channel 




completed bv Ciccolini/Denner/Dittmaier 11341 1351 




Interference QCD-EW in VBF channel 1361133 


3.pp^VVV 


ZZZ completed by Lazopoulos/Melnikov/Petriello |38| 




and WWZ by Hankele/Zeppenfeld [39], 




see also Binoth/Ossola/Papadopoulos/Pittau |40| 




VBFNLO Ell 113 meanwhile also contains 




WWW, ZZW, ZZZ, WW J, ZZ-f, WZ-f, W-f-f, Z77, 




777, ^77J E2@l|45l|4i|471|2ll 


4. pp — ;> tibb 


relevant for tiH, computed by 




Bredenstein/Denner/Dittmaier/Pozzorini ||48l |49ll 




and Bevilacqua/Czakon/Papadopoulos/PittauAVorek |50| 


5. pp^ y +3 jets 


W+3 jets calculated bv the Blackhat/Sherpa fiSTl 




and Rocket ||52| collaborations 




Z+3jets by Blackhat/Sherpa [53] 


Calculations remaining from Les Houches 2005 




6. pp — > tt+2jets 


relevant for tiH, computed by 




Bevilacqua/Czakon/Papadopoulos/Worek 11541 1551 


l.pp^VVbb, 


Pozzorini et al.[25|,Bevilacqua et al.|23| 


8. pp -^ yy+2jets 


W+W++2iets [56], W+W'+ljets 15711581, 




VBF contributions calculated by 




(Bozzi/) Jaser/Oleari/Zeppenfeld l59ll60l l6ll 


NLO calculations added to list in 2007 




9. pp ^ bbbb 


Binoth et al. miia 


NLO calculations added to list in 2009 




10. pp -^ y + 4 jets 


top pair production, various new physics signatures 




Blackhat/Sherpa: Vl^+4jets (221, Z+4jets |201 




see also HEJ [64^ for P1/^ + njets 


11. pp ^ Wbbj 


top, new physics signatures, Reina/Schutzmeier jfTTI 


12. pp -^ titi 


various new physics signatures 


also completed: 




pp^Wjj jet 


Campanario/Englert/Rauch/Zeppenfeld ll2 1 ll 


pp — ;> 4jets 


Blackhat/Sherpa [191 



Table 1: The updated experimenter's wishUst for LHC processes 



Calculations beyond NLO added in 2007 




13. gg -^ W*W* 0{a^a^,) 

14. NNLO pp -^ ti 

15. NNLO to VBF and Z/7+jet 


backgrounds to Higgs 

normalization of a benchmark process 

Higgs couplings and SM benchmark 


Calculations including electroweak effects 




16. NNLO QCD+NLO EW for W/Z 
NLO EW to W/Z 
NLO EW to W/Z+jet 
NLO EW to WH/ZH 


precision calculation of a SM benchmark 

|65l|66l 

I671I68I 

|69| 



Table 2: The updated experimenter's wishlist for LHC processes continued 



on the way to complete NNLO results for tt production 
W^W~ production: 

importand background to Higgs search. At the LHC, gg — )• WW is the dominant subprocess, but 
gg — )■ WW is a loop-induced process, such that two loops need to be calculated to get a reliable 
estimate of the cross section. Advances towards the full two-loop result are reported in |[96ll97l . 
inclusive jet/dijet production: 

NNLO parton distribution function (PDF) fits are starting to become the norm for predictions and 
comparisons at the LHC. Paramount in these global fits is the use of inclusive jet production to 
tie down the behavior of the gluon distribution, especially at high x. However, while the other 
essential processes used in the global fitting are known to NNLO, the inclusive jet production 
cross section is only known at NLO. Thus, it is crucial for precision predictions for the LHC for 
the NNLO corrections for this process to be calculated, and to be available for inclusion in the 
global PDF fits. First results for the real- virtual and double real corrections to gluon scattering can 
be found in imm. 
V-i-1 jet production: 

W/Z/ J + jet production form the signal channels (and backgrounds) for many key physics pro- 
cesses, for both SM and BSM. In addition, they also serve as calibration tools for the jet energy 
scale and for the crucial understanding of the missing transverse energy resolution. The two-loop 
amplitudes for this process are known HlOOi llOll . therefore it can be calculated once the parts 
involving unresolved real radiation are available. 
V-H7 production: 

important signal/background processes for Higgs and New Physics searches. The two-loop helicity 
amplitudes for qq — ;> W^j and qq -^ Z^j recently have become available II102L 
Higgs-i-1 jet production: 

As mentioned previously, events in many of the experimental Higgs analyses are separated by the 
number of additional jets accompanying the Higgs boson. In many searches, the Higgs -1- jet and 
Higgs -I- 1 jet bins contribute approximately equally to the sensitivity. It is thus necessary to have 
the same theoretical accuracy for the Higgs -1- 1 jet cross section as already exists for the inclusive 
Higgs cross section, i.e. NNLO. The two-Loop QCD Corrections to the Helicity Amplitudes for 
H 



3 partons are already available II103II . 

The contributions in this document are arranged as follows. In section |llj various developments 
concerning techniques for NLO and NNLO calculations are described, in particular in view of providing 



automated tools for NLO corrections. In section III issues related to parton distribution functions are 



discussed. Section IV contains phenomenological studies of observables and uncertainties, based on 
theory input where higher order corrections obtained by different approaches are available. Section [V] 
includes phenomenological studies on the definition of experimental observables and corrections applied 



to data. Finally Section VI discusses issues related to the tuning of Monte Carlos and standardised Monte 
Carlo output formats. 



Part II 

NLO AUTOMATION AND (N)NLO 
TECHNIQUES 

1. PJFRY - A C++ PACKAGE FOR TENSOR REDUCTION OF ONE-LOOP FEYNMAN IN- 
TEGRALS Q 

Abstract 

The C++ package PJFry 1 .0.0 II 1041 11051 - a one loop tensor integral hbrary - 
is introduced. We use an algebraic approach to tensor reduction. As a result, 
the tensor integrals are presented in terms of scalar one- to four-point func- 
tions, which have to be provided by an external library, e.g. QCDLoop/FF 
or OneLOop or LoopTools/FF The reduction is implemented until five-point 
functions of rank five. A numerical example is shown, including a special 
treatment for small or vanishing inverse four-point Gram determinants. Future 
modules of PJFry might cover the treatment of n-point functions with n > 6; 
the corresponding formulae are worked out. Further, an extremely efficient 
approach to tensor reduction relies on evaluations of complete contractions of 
the tensor integrals with external momenta. For this, we worked out an algo- 
rithm for the analytical evaluation of sums over products of signed minors with 
scalar products of chords, i.e. differences of external momenta. As a result, 
the usual multiple sums over tensor coefficients are replaced for the numerical 
evaluation by compact combinations of the basic scalar functions. 

1.1 PJFry 

The goal of the C++ package PJFry is a stable and fast open-source implementation of one-loop tensor 
reduction of Feynman integrals 

7'm-/^H = n(A [ AA nr=i k^" /,x 

suitable for any physically relevant kinematics jj The algorithm was invented in f\05\. PJFry performs 
the reduction of 5-point 1-loop tensor integrals up to rank 5. The 4- and 3-point tensor integrals are 
obtained as a by-product. Main features are: 

• Any combination of internal or external masses 

• Automatic selection of optimal formula for each coefficient 

• Leading Os are eliminated in the reduction 



'Contributed by: J. Fleischer, T. Riemann, V. Yundin 

^An extended description of notations and of the formalism may be found in 1 1051! 1061 11071 [T08l . The normalization of 
PJFry follows that chosen in the scalar library. For QCDLoop, C(e) = r(l - 2e)/[r(l + e)r'^(l - e)]. 



• Small ()4 are avoided using asymptotic expansions where appropriate 

• Cache system for tensor coefficients and signed minors 

• Interfaces for C, C++, FORTRAN and Mathematica 

• Uses QCDLoop II1091I1101 or OneLOop [ill 111 for 4-dim scalar integrals 

• Available from the project webpagelhttps : //github. com/Vavu/P JFry/l lll04[ |105i 
The installation of PJFry may be performed following the instructions given at the project webpage. 
The project subdirectories are 

./src - the library source code 

./mlink - the MathLink interface 

./examples - the FORTRAN examples of library use, built with make check 

A build on Unix/Linux and similar systems is done in a standard way by sequential performing 
./configure, make, make install. See the INSTALL file for a detailed description of the ./configure op- 
tions. 

The functions for tensor coefficients for up to rank R = 5 pentagon integrals are declared in the 
Mathematica interface: 

In:= Names ["PJFry '*"] 

Out= {AOvO, BOvO, BOvl, B0v2, COvO, COvl, C0v2, C0v3, \ 
DOvO, DOvl, D0v2, D0v3, D0v4, EOvO, EOvl, E0v2, \ 
E0v3, E0v4, E0v5, GetMu2, SetMu2 } 

The C++ and Fortran interface syntax is very close to that of e.g. LoopTools/FF: 

E0v3 [i, j, k,pls,p2s,p3s,p4s,p5s, sl2, s23, s34, s45, sl5,mls,m2s,m3s,m4s,m5s, ep=0] 

where J3 
i , j , k are indices of the tensor coefficient (0 < i < j < A; < n), 
pls,p2s, . . . are squared external masses pf, 
sl2, s2 3, . . . are Mandelstam invariants {pi +pj)'^, 
ml s , m2 s , . . . are squared internal masses mf, 
ep= , - 1 , - 2 selects the coefficient of the e-expansion. 

The average evaluation time per phase-space point on a 2 GHz Core 2 laptop for the evaluation of 
all 8 1 rank 5 tensor form-factors amounts to 2 ms. 

A numerical example is shown, for a configuration as in figure[TJ in figures [2]and[3]for a five-point 
rank R = A tensor coefficient in a region, where one of the 4-point sub-Gram determinants vanishes [at 

x = 0]: 

^3333(0, 0, -6xl0^(j; + 1), 0, 0, 10^ -3.5xl0^ 2x10'', -4x10'^, L5xl0^ 0, 6550, 0, 0, 8315) 

The red curve is produced with standard PJFry, and the blue one with Passarino-Veltman [PV] reduction 
IIII2I : we mention that for the case treated here (x — )■ 0), the PV reduction is no standard option. Our 
expansion in terms of higher dimensional scalar 3-point functions in case of vanishing 4-point sub- 
Gram determinants uses only functions I^'^^ II105II . These are tensor coefficients of the pure g'^'^ type 
IIII3I . and so our method is different from others with a mixed numerical approach HI 1411 or with use of 
additional tensor coefficients II115II . 

Tensor reduction by PJFry is used as one option of the GoSam package |[T2l . An older version of 
the algorithm, as described in [1161 , has been implemented independently in |fTTl|. 



''One has to carefully control accuracies; e.g. the on-shell conditions for massless particles have to be fulfilled with a 
numerical precision expected by the scalar functions library in use; for QCDLoop this means on default at least 10 digits. 
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Fig. 1 : Momenta definitions for PJFry. 




1.2 POTENTIAL UPGRADES 

1.21 Tensor reduction for higher-point functions 

So far, PJFry is foreseen for 5-point functions and simpler ones. The extension to 6-point functions is 
known from e.g. 111141 11151 II 1611 . In II107II we solve analytically generalized recursions for n > 6, 
derived in 111 141 : 






1M2---Mfl 



5^cr(n)c-r"'^ 



(2) 



r=l 



where in I^i^i^ the line r is scratched. The coefficients for 6-point functions are: 



'''^''4±0.<'' 



s = 0...6, 



(3) 



/Or 



where the qi aie chords, and (^J^ etc. are signed minors with arbitrary s. For the 7-point and 8-point 
functions, we found several representations, among them 



Cf'''i7) = Yl 



1 I sti 



1=1 Vst/7 






(4) 



and 



The upper indices s, t and u stand for the redundancy of the solutions and can be freely chosen. 



(5) 



1.22 Evaluation of contracted tensor integrals using sums over signed minors 

The contraction of a tensor integral with chords may be written as a sum over basic scalar integrals (at a 
stage where they are free of tensor coefficient indices), multiplied by (multiple) sums over chords times 
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Fig. 2: Absolute accuracy of -Essas in the region of vanishing sub-Gram determinant. Blue curve: con- 
ventional Passarino-Veltman reduction, red curve: PJFry. 



signed minors. If one may perform these sums algebraically, the method becomes very efficient. And 
this has been systematically worked out in II106L see also II108I . 

We reproduce here two 7 -point examples. 

The rank R = 2,3 integrals become by contraction 



r,t=l 

7 



rt 
5 ) 



Qa,ij.qb,uQc,X ^7 



^uX 



E''" 



abc,rtu jrtu 



(6) 

(V) 



r,t,u=l 



where I^^ and I4*" are scalar 5- and 4-point functions, arising from the 7-point function by scratching 
lines r, t, . . . In the general case, we have at this stage higher-dimensional integrals I^"*"^^, n = 2, . . . , 5, 
to be further reduced following the known scheme, if needed. Here, the Ig* have to be expressed by 
4-point functions. 

The expansion coefficients are factorizing here, 



K 



jy-abjrt 
abc,rtu 



j^a,rj^b,rt 

T/'a,,r iy-b,rt r/'C,rtu 



and the sums over signed minors have been performed analytically: 

1 



K" 



{Sa 



6' 



It) 



(8) 
(9) 



(10) 



K 



b.rt 



6 trst\ 

j=l \rs) 7 



U, 



l,stu 



rrs\ 
\rsj; 



is> 



bt 



J7t) 



H7s)7 



{s. 



br 



J7r) 



(11) 



11 



1. X 10"* 

S. X lO"' 



-5. X 10"" 




Fig. 3: Relative accuracy of -Essss in the region of vanishing sub-Gram determinant. At x 
PJFry switched to the asymptotic expansion. 



0.0015, 



K 



a,stu 




Ostu 
Osti 



E: 



2,stu 



(12) 



7 



2 1 V .to y , (^- - ^-) + (olt) , ^'- - ^-) - (o W , ('^^ - '-) - (ot) , (^- - '^-^ 



with 



Yjk = -{qj - Qk) + rUj + m^. 



(13) 



Conventionally, qj = 0. 

The sums may be found in eqns. (A. 15) and (A. 16) of II106II . The s is redundant and fulfils 

s y^ r,b, 7 in K^'^^. In Kq'* " it is s, t, u = 1, . . . 7 with s ^ u,t ^ u. 

ACKNOWLEDGEMENTS 

J.F. thanks DESY for kind hospitality. Work is supported in part by Sonderforschungsbereich/Transregio 
SFB/TRR 9 of DFG "Computergestiitzte Theoretische Teilchenphysik" and European Initial Training 
Network LHCPHENOnet PITN-GA-2010-264564. 

2. THE GOSAM APPROACH TO AUTOMATED ONE-LOOP CALCULATIONS 

Abstract 

We describe the GoS AM framework for the automated computation of multi- 
particle scattering amplitudes at the one-loop level. The amplitudes are gen- 
erated explicitly in terms of Feynman diagrams, and can be evaluated using 
either d-dimensional reduction at the integrand level or tensor decomposition. 
GoSam can be used to compute one-loop QCD and EW corrections to Stan- 
dard Model processes, and it is ready to link generic model files for theories 
Beyond the Standard Model. 

''Contributed by: G. Cullen, N. Greiner, G. Heinrich, G. Luisoni, P. Mastrolia, G. Ossola, T. Reiter, F. Tramontano 
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2.1 Introduction and General Motivations 

In the last few years we observed major advances in the direction of constructing packages for fully 
automated one-loop calculations, which profited from the new developments in the field of NLO QCD 
calculations ifTTlfTSJI . The continuous improvement of the techniques for one-loop computations led to 
important new results for processes with many particles lEB |70l [fnl |7lll5l|48l|l9ll50l|5l|62l|62|23 
[TT81 1251 1561 1571 12X11221 1201 [T91. 

Very advanced calculations have been performed with improved algebraic reduction methods 
based on Feynman-diagrammatic algorithms, as well as with new numerical techniques based on the 
idea of reconstructing one-loop amplitudes from their unitarity cuts. These theoretical developments 
found an ideal counterpart in the integrand-level reduction algorithm, that allows for the reduction of any 
scattering amplitudes to scalar master integrals, simply by evaluating numerically the integrand at given 
fixed values of the integration momentum. In both scenarios, to tackle the increase in the complexity 
and in the number of diagrams that contribute to the amplitudes, automation becomes indispensable for 
processes with many external legs. 

The purpose of the present document is to illustrate the main features of GoSam llT2ll . a new 
framework which allows the automated calculation of one-loop scattering amplitudes for multi-particle 
processes. This approach combines the automated algebraic generation of d-dimensional unintegrated 
amplitudes obtained via Feynman diagrams, with the numerical integrand-level reduction provided by 
the d-dimensional extension |[TT9l [T20l l6ll of the GPP integrand-level reduction method ifHTl IT221 [T23l 
and improved tensorial techniques Ill24[|125ll . 

The integrands of the one-loop amplitudes are generated via Feynman diagrams, using 
QGRAF II126II . FORM 1.127.1 . spinney [.128.1 and haggles 111291 . The only task required from the 
user is the preparation of an "input card" to start the generation of the source code and its compilation, 
without having to worry about internal details of the code generation. The individual program tasks are 
efficiently managed by python scripts. Concerning the reduction, the program offers the possibility to 
use either the d-dimensional extension of the OPP method, as implemented in SAMURAI [6J, or tensor 



reduction as implemented in Golem95C II130I 11311 interfaced through tensorial reconstruction at the 



integrand level II124II . 

2.2 Algebraic approach to Automation 

There are several approaches to the automated computation of multi-particle scattering amplitudes at the 
one-loop level, which provide different recipes for the construction of multi-purpose tools. The goal of 
such tools is the evaluation of one-loop scattering amplitudes for any choice of particles in the initial and 
final states, in a fully automated manner. 

In the algebraic approach to multi-purpose automation, amplitudes can be generated from Feyn- 
man diagrams by employing tools for algebraic manipulation: already some time ago, the idea of au- 
tomating NLO calculations has been pursued by public programs like FeynArts [132] and QGRAF [[12611 
for diagram generation and FormCalc/LoopTools [I1331 and GRACE |l3l for the automated calculation of 
NLO corrections, primarily in the electroweak sector. 

When we combine the algebraic generation with the integrand-level reduction, the set of algebraic 
operations required are quite different with respect to a traditional tensorial reduction. Since the target 
is to provide the numerical value of the numerator function at given values of integration momentum, 
we should aim at expressions for the unintegrated numerator that are easily evaluated numerically. To 
achieve this task, for example, expressions in terms of spinor products are particularly convenient. 

We briefly list here some of the advantages of the "algebraic approach": i) the algebraic genera- 
tion is executed separately from the numerical reduction, therefore algebraic manipulations are possible 
before starting the numerical integration; CPU-time can be spent, once for all at the beginning of the cal- 
culation, to optimize and reduce the size of the integrands that will be evaluated numerically several times 
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later on during the reduction; ii) the algebraic method allows us to group sets of diagrams and cache all 
factors that do not depend on the integration momentum; iii) easy access to sub-parts of the computation; 
subsets of diagrams can be easily added or removed from the final results, simplifying comparisons and 
tests; iv) computer algebra can be performed in dimension d using alternative regularization schemes; v) 
the choice between different reduction algorithms can be performed at run-time, providing flexibility and 
internal cross-checks. In the next section we will briefly illustrate how these properties are used within 
GoSam. 

Important progress in a similar direction has been also recently achieved by means of FeynArts, 
FormCalc and LoopTools 111341 l2i to provide amplitudes that can be processed using the integrand-level 
reduction provided by CutTools II135II and/or SAMURAI [6] or with the traditional Passarino-Veltman 
reduction Iill2il. 



2.3 A brief introduction to GoSam 

GoSam produces in a fully automated way all the code required to perform the calculation of virtual 
one-loop amplitudes. The only task left to the user is the preparation of an "input card" which contains 
all the information related to the particular process namely initial and final particles, model, helicities, 
selection rules to exclude particular sets of diagrams, regularization scheme. The card also contains flags 
to select the preferred reduction methods and some optimization flags to adapt the diagram generation to 
the needs of the user. 

There are three main steps that GoSam follows in order to prepare the code for the calculation: 
the generation of diagrams that contribute to the process, the optimization and algebraic manipulation to 
simplify their expressions, and the writing of a FORTRAN code ready to be used within a phase-space 
integration. It is important to remember that these steps will only be performed once. After the code is 
generated, the reduction of unintegrated amplitudes to linear combinations of scalar (master) integrals 
is fully embedded in the process and can be performed with different options, all available at run-time. 
Only the last part, namely the reduction and evaluation of master integrals, will be repeated for all the 
different phase-space points that contribute to the cross-section. 

2.31 Diagram Generation 

For the diagram generation both at tree level and one-loop level we employ QGRAF II 12611 which we 
complemented by adding another filter over diagrams implemented in Python. This gives several ad- 
vantages since it increases the ability of the code to distinguish certain classes of diagrams and group 
them according to the sets of their propagators, in order to fully optimize the reduction. 

At this stage GoSam generates three sets of output files: an expression for each diagram for 



FORM III27I . Python code for drawing each diagram, and Python code for computing the properties 
of the diagram. Information about the model is either read from the built-in Standard Model of QGRAF or 
can be defined by the user by means of LanHEP II 1361 or an Universal FeynRules Output (UFO) file II 1371 

The Python program automatically performs several operations: diagrams whose color factor 
turns out to be zero are dropped; the number of propagators containing the loop momentum, the tensor 
rank and the kinematic invariants of the associated loop integral are computed; diagrams with a vanishing 
loop integral associated are detected and flagged for the diagram selection; all propagators and vertices 
are classified for the diagram selection; diagrams containing massive quark self-energy insertions or 
closed massless quark loops are specially flagged. 

During this phase, GoSam also generates a I^T^X file which contains, among other useful infor- 
mation of the generated process, the drawings of all contributing diagrams. To achieve this task, we use 
our own implementation of the algorithms described in Ref. II138II and Axodraw 111391 to actually draw 
the diagrams. 
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2.32 Lorentz Algebra 

Concerning the algebraic operations performed by GoSam to render the integral suitable for efficient 
numerical evaluation, one of the primary goals is to split the (4 — 2e) dimensional algebra into strictly 
four-dimensional objects and symbols representing the higher-dimensional remainder. All external vec- 
tors (momenta and polarisation vectors) are kept in four dimensions; internal vectors, however, are kept 
in the d-dimensional vector space. 

We adopt the conventions used in II128I1 . where k denotes the four dimensional projection of an in 
general d dimensional vector k. The {d — 4) dimensional orthogonal projection is denoted as k. For the 
integration momentum q we introduce in addition the symbol /i^ = —q^, such that 

q^ = f + f = f- //^. (14) 

We also introduce suitable projectors by splitting the metric tensor 

9^'' = g^'' + r'', 9'^"9.p = 0, gi: = i, r^ = d-A. (15) 

Once all propagators and all vertices have been replaced by their corresponding expressions, ac- 
cording to the model file, all vector-Uke quantities and metric tensors are split into their four-dimensional 
and their orthogonal part. As we use the 't Hooft algebra, 75 is defined as a purely four-dimensional ob- 
ject, 75 = ie^yp(j^^^^^^P^" . By applying the usual anti-commutation relation for Dirac matrices we can 
always separate the four-dimensional and {d — 4) -dimensional parts of Dirac traces. 

While the {d — 4) -dimensional traces are reduced completely to products of {d — 4) -dimensional 
metric tensors, the four-dimensional part, which will be reduced numerically, is treated such that the 
number of terms in the resulting expression is kept as small as possible. 

2.33 Treatment of rational terms R2 

Instead of relying on the construction of R2 from specialized Feynman rules II123I 11401 11411 [T421I143L 
we can generate the R2 part along with all other contribution using automated algebraic manipulations. 
The code offers the option between the implicit and explicit construction of the R2 terms. The implicit 
construction treats the 4— and {d — 4) dimensional numerators on equal grounds: they are generated 
algebraically and reduced numerically. The explicit construction of R2 is based on the fact that the 
(d — 4) dimensional part of the numerator function contains expressions for the corresponding integrals 
that are relatively simple and known explicitly. Therefore, after separating it using the algebraic ma- 
nipulation described before, the {d — 4) dimensional part is computed analytically whereas the purely 
four-dimensional part is passed to the numerical reduction. This approach also allows for an efficient 
calculation of the part R2 alone. 

2.34 Reduction to scalar (master) integrals 

GoSam allows to choose at run-time (i.e. without re-generating the code) the preferred method of 
reduction. Available options include the integral-level d-dimensional reduction, as implemented in 
SAMURAI, or traditional tensor reduction as implemented in Golem95C interfaced through tensorial 
reconstruction at the integrand level, or a combination of both. Concerning the scalar (tensorial) inte- 
grals, GoSam allows to choose among a variety of options, including QCDLoop II109L OneLoop Ullll . 
Golem95C 111301 . plus the recently added P JFRY 111081 Among these codes, OneLoop and Golem95C 
also fully support complex masses. 

2.4 Installation and Usage 

GoSam can be used within a standard Linux/Unix environment. In order to work, it requires some 
programs to be installed on the system: these include a recent version of Python (version > 2.6), Java 
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(> 1.5), a Fortran95 compiler, FORM (version > 3.3), and QGRAF. Further, at least one of the libraries 
SAMURAI or Golem95C needs to be present at the time the code is compiled. 

To facilitate this task, we have prepared a package containing SAMURAI and Golem95C to- 
gether with the libraries for the integrals: OneLOop, QCDLoop, and FF. The package, which is called 
gosam-contrib-1 . . tar . gz is available for download from 

http: //projects .hepf orge . org/gosam/. 

The installation procedure is facilitated by the use ofAutotools. 

The user can download the GoSam code either as a tar-ball or from the subversion repository 
at http://projects.hepforge.org/gosam/. The installation of GoSam is controlled by 
Python Distutils and can be performed by simply running the command 

python setup. py install 

In order to generate the code for a process, the user needs to prepare an input file (process card) which 
contains: 

- process specific information, such as a list of initial and final state particles, their helicities (op- 
tional) and the order of the coupling constants; 

- scheme specific information, such as the regularisation and renormalisation schemes, the underly- 
ing model, masses and widths which are set to zero; 

- system specific information, such as paths to programs and libraries or compiler options; 

- optional information for optimisations within the code generation. 

Assuming that the process card is called myprocess . in, the generation of the code can be started 
by simply running the command gosam.py myprocess . in. All further steps are controlled by 
makefiles which are automatically generated by GoSam: the command make compile generates 
the source code and compiles all files relevant for the production of the matrix element. The code can be 
tested with the program t e s t . f 9 (located in the subdirectory mat r i x) which provides, for a random 
phase-space point, the tree-level LO matrix element and the NLO result for the finite part, single and 
double poles. Examples of process cards for a selection of benchmark processes are provided with the 
main distubution. 

For more details about the usage and installation of GoS AM, we refer the user to a more technical 
presentation II 14411 or to the original pubhcation |[T2l and the user manual which accompanies the code. 

2.5 Examples of Applications 

The BLHA interface 1 145] allows to link GoSam to a general Monte Carlo event generator, which is 
responsible for supplying the missing ingredients for a complete NLO calculation of a physical cross 
section. Among those, SHERPA 111461 offers the possibility to compute the LO cross section and the 
real corrections with both the subtraction terms and the corresponding integrated counterparts [|147l . 
Using the BLHA interface, we linked GoSam with SHERPA to compute the physical cross section for 



W + 1-jet at NLO, which is described in Section 18 



The codes produced by GoS AM have been tested on several processes of increasing complexity, 
some of which are shown in Table 1. The full list of processes produced by GoSam and compared to 
the literature where available is given in Ref. [12J. 

As an example of the usage of GoSam with a model file different from the Standard Model, we 
calculated the QCD corrections to neutralino pair production in the MSSM. The model file has been 
imported using the UFO interface. In this calculation, we combined the one-loop amplitude with the real 
radiation corrections to obtain results for differential cross sections. For the infrared subtraction terms 
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e+e — > uu 


1148,! 


e+e- -^ ti 


I.149l|150|, own analytic calculation 


uu — ^ dd 


imiEi 


99~> gg 


|T5l 


gg-^ gz 


1153] 


bg^Hb 


|154]|8l 


77 — > 77 (W loop) 


1155] 


77 — ;► 7777 (fermion loop) 
pp — >■ ti 


1,156] 


181. MCFM 1157111581 


pp -^ W^ j (QCD corr.) 
pp -^ W^ j (EW corr.) 


1157111581 


for IR poles: II65II159II 


pp^W^t 


1157111581 


pp -^ W^ jj 


1157,158] 


pp — > W^bb (massive b) 


I.157..1581 


e+e- -^ e+e"7 (QED) 


CSol 


pp^ Htt 


m 


pp^ Ztt 


\m 


pp — > W+W+jj 


I56lv3] 


pp -> bbbb 


162,11] 


pp -> W+W^bb 


i.8..i6n 


pp — ^ tibb 


iiiiiD 


ud — T' W+ggg 


HSU 



Table 3: Some of the processes computed and checked with GoS AM 



we employed MadDipole 111621 . while the real emission part is calculated using MadGraph/MadEvent 



II163I . The virtual matrix element is renormalized in the MS scheme, while massive particles are treated 
in the on-shell scheme. The renormalization terms specific to the massive MSSM particles have been 
added manually. In Fig|4| we show the differential cross section for the moo invariant mass, where 
we employed a jet veto to suppress large contributions from the channel qg — ;• XiXi9 which opens 
up at order Q^a^, but for large p!^ belongs to the distinct process of neutralino pair plus one hard jet 
production at leading order. 
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Fig. 4: Comparison of the NLO and LO fn o o distributions for the process pp — )• xixi with a jet veto 
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on jets with pj, > 20 GeV and r/ < 4.5. The band gives the dependence of the result on fi = fip = hr 
between fio/2 and 2/io. We choose /io = mz- 
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Conclusions and Outlook 

Several groups are currently working at the development of automated multi-purpose tools for one-loop 
calculations. For quite a long time, tree-level calculation have been fully automated and included in 
flexible multi -process tools 111641 |165i The level of automation achieved by one-loop calculations is 
suggesting the possibility of a similar success also at the next-lo-leading order. The target is to build 
efficient and flexible NLO programs which can be used to tackle the increasing need of precision required 
by the experimental collaborations. 

GoSam is a flexible and broadly applicable tool for the fully automated evaluation of one-loop 
scattering amplitudes. In this approach, scattering amplitudes are generated in terms of Feynman dia- 
grams and their reduction to master integrals can be performed in several ways, which can be selected at 
run-time. GoSam can be used to calculate one-loop corrections both in QCD and electro-weak theory 
and offers the flexibility to link general model files for theories Beyond the Standard Model. The code 
performed well in reproducing a wide range of examples and we are looking forward to tackle more 
challenging calculations and interfacing with other existing tools in the near future. 
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3. AUTOMATION AND NUMERICAL LOOP INTEGRATION 

3.1 INTRODUCTION 

Numerical methods are nowadays routinely used in fully differential fixed-order perturbative calculations 
for the integration over the phase-space of the final-state particles. The use of numerical methods for 
the phase-space integration allows the flexibility to compute any infrared-safe observable for a given 
process within a single numerical program. It is thus natural to investigate if numerical methods can also 
be apphed for the loop integration in the virtual corrections lfT66l [TFTl 11681 [T69l UTOI [TtTI 17721 [T73l 
I174[ 11751 11761 . A major breakthrough was achieved recently by showing that the numerical method is 
compatible in efficency with the commonly used approaches based on cut techniques and generalised 
unitarity or on Feynman graphs |J70l |5D 122112211201 |52l UTTl |56l|5l EOlilHl UTTl \m\ JH EHl E 113. 
The implementation of the numerical method for the loop integration is process-independent and offers 
therefore the flexibility to compute several processes within one numerical program. We discuss the main 
principles of the numerical method for the loop integration at one-loop. In addition we give an outlook 
towards higher loops. 

3.2 THE SUBTRACTION METHOD FOR THE LOOP INTEGRATION 

The contributions to an infrared-safe n-jet observable observable O at next-to-leading order are given by 

(O)^^^ = f On+ida^ + f Onda^ + I Onda^' . (16) 

n+l n n 

'^Contributed by: S. Weinzierl 
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Here a rather condensed notation is used, da^ denotes the real emission contribution, whose matrix 
elements are given by the square of the Born amplitudes with (n + 3) partons jyl^^lgp. da^ denotes the 
virtual contribution, whose matrix elements are given by the interference term of the one-loop and Born 
amplitude Re {An^2^n+2) ^^'^ '^^^ denotes a coUinear subtraction term, which subtracts the initial state 
coUinear singularities. Each term is separately divergent and only their sum is finite. 

The subtraction method is widely used to render the real emission part of a NLO calculation 
suitable for a numerical Monte Carlo integration. One adds and subtracts a suitably chosen piece to be 
able to perform the phase-space integrations by Monte Carlo methods: 

l^O)NLO ^ f (^On+ida^ - Onda"^) + / j Onda^ + Onda^ + 0„ I da^] . (17) 

n+l n \ 1 / 

The first term (^On+ida^ — Onda"^^ is by construction integrable over the (n + l)-particle phase-space 
and can be evaluated numerically. The result of the integration of the subtraction term over the unresolved 
one-parton phase-space is written in a compact notation as 

da^ + [ da^ = (I + K + P) <g) da^ . (18) 



The notation indicates that colour correlations due to the colour charge operators Tj still remain. The 
terms with the insertion operators K and P pose no problem for a numerical evaluation. The term l®da^ 
lives on the phase-space of the n-parton configuration and has the appropriate singularity structure to 
cancel the infrared divergences coming from the one-loop amplitude. Therefore da^ + 1 ® da^ is 
infrared finite. 

We extend this subtraction method to the virtual part such that we can evaluate the one-loop in- 
tegral of the one-loop amplitude numerically. The renormalised one-loop amplitude A^^' is related to 
the bare amphtude Ay^J^^.^ by A'^^^ = A\^^^^ + -4,^^, where Aqj^ denotes the ultraviolet counterterm from 
renormalisation. The bare amplitude involves the loop integration 

(1) ^ f^!^Ai) n9) 

where ^^arc denotes the integrand of the bare one-loop amplitude. We introduce subtraction terms which 
match locally the singular behaviour of the bare integrand: 



i(i) , ;i(i) _ I ^ fr.W r.W r.W r.W\ , f aW , aW , aW , ;i(i) 
(2vr) 



a(^) <a(')- / ^ "^ G^M _ a^'> - G^'> - G^'' ] + A^'' + A^'> + A^'> + A'-'' ] (20) 



Analogous to GI;^,.^, the integrands of the subtraction terms Ax are denoted by ^^i , where x is equal to 
soft, coll or UV. The expression in the first bracket is finite and can therefore be integrated numerically in 
four dimensions. The integrated subtraction terms in the second bracket are easily calculated analytically 
in D dimensions. The result can be written as 

2Re^(0)(^g^ + ^;^;, + 4oli + -4uv)*O„#„ = L®d(7^. (21) 

The insertion operator L contains the explicit poles in the dimensional regularisation parameter related 
to the infrared singularities of the one-loop amplitude. These poles cancel when combined with the 
insertion operator I: 

{I + 'L)(g)da^ = finite. (22) 

19 



The operator L contains, as does the operator I, colour correlations due to soft gluons. In analogy to the 
one-loop amplitude we can write da^ = da ct + J it lo ^'^barc ^^^ ^^^"^ ^^^ NLO contributions reads 

I^OfLO ^ (23) 

/ [On+ida^ - Onda^) + /" 0„ (dal^^^ - da^''^ + / 0„ (I + L + K + P) da^ . 

n+l n+loop n 

In a condensed notation this reads 

(O)^^^ = (0>ir + (O),^,^?^^ + (O)Sion. (24) 

Every single term is finite and can be evaluated numerically. 

3.3 THE SUBTRACTION TERMS 

Amplitudes in QCD may be decomposed into group-theoretical factors (carrying the colour structures) 
multiplied by kinematic factors called partial amplitudes. At the loop level partial amplitudes may further 
be decomposed into primitive amplitudes. It is simpler to work with primitive one-loop amplitudes 
instead of a full one-loop amplitude. Our method exploits the fact that primitive one-loop amplitudes 
have a fixed cyclic ordering of the external legs and that they are gauge-invariant. The first point ensures 
that there are at maximum n different loop propagators in the problem, where n is the number of external 
legs, while the second property of gauge invariance is crucial for the proof of the method. We therefore 
consider in the following just a single primitive one-loop amplitude, which we denote by A^^\ while 
keeping in mind that the full one-loop amplitude is just the sum of several primitive amplitudes multiplied 
by colour structures. We label the external momenta clockwise by pi, p2, ..., Pn and define qi = pi + 
P2 + ... + Pi, ki = k — Qi. We can write the bare primitive one-loop amplitude in Feynman gauge as 

\le = J j^:;^G,l.^, G,l^ = P{k) H ^2 _ ^2 ^ ^5 • (25) 

^bare ^^ ^^^ integrand of the bare one-loop amplitude. P{k) is a polynomial in the loop momentum k. 
The +i(^-prescription instructs us to deform - if possible - the integration contour into the complex plane 
to avoid the poles at kf = mf. If a deformation close to a pole is not possible, we say that the contour 
is pinched. If we restrict ourselves to non-exceptional external momenta, then the divergences of the 
one-loop amplitude related to a pinched contour are either due to soft or coUinear partons in the loop. 
These divergences are regulated within dimensional regularisation by setting the number of space-time 
dimensions equal to D = A — 2e. A primitive amplitude which has soft or coUinear divergences must 
have at least one loop propagator which corresponds to a gluon. An amplitude which just consists of a 
closed fermion loop does not have any infrared divergences. We denote by Ig the set of indices i, for 
which the propagator i in the loop corresponds to a gluon. The soft and coUinear subtraction terms for 
massless QCD read |fT69l 



'Sjgijv{k]_i,k'^) Sj+iguy{kj, kj^^) 






+ 



1,2 h.2 ' ^,2ir.2 



Af , (26) 



where Sj = 1 if the external line j corresponds to a quark and Sj = 1/2 if it corresponds to a gluon. 
The function g\]^J ensures that the integration over the loop momentum is ultraviolet finite. Integrating 
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the soft and the coUinear part we obtain 






d^A; ^(1) a, e^T^ v-.o , a x2//"L 



is ^9 

S'e = (47r)^e~^'^^ is the typical volume factor of dimensional regularisation, 7^; is Euler's constant and 
/i is the renormalisation scale. 

The ultraviolet subtraction terms correspond to propagator and vertex corrections. The subtraction 
terms are obtained by expanding the relevant loop propagators around a new ultraviolet propagator (A;^ — 
/iyy)"^, where k = k — Q: For a single propagator we have 

1 _ 1 2k-{p-Q) {p-Qf + i^ly [2k-{p-Q)\' ( 1 



(A:-pf k^-^^l^ {k'-f^lyf C^'-f^lvf (k'-f^lyf VI^P 

We can always add finite terms to the subtraction terms. For the ultraviolet subtraction terms we choose 
the finite terms such that the finite parts of the integrated ultraviolet subtraction terms are independent of 
Q and proportional to the pole part, with the same constant of proportionality for all ultraviolet subtrac- 
tion terms. This ensures that the sum of all integrated UV subtraction terms is again proportional to a 
tree-level amplitude II167II . 

3.4 CONTOUR DEFORMATION 

Having a complete list of ultraviolet and infrared subtraction terms at hand, we can ensure that the inte- 
gration over the loop momentum gives a finite result and can therefore be performed in four dimensions. 
However, this does not yet imply that we can safely integrate each of the four components of the loop 
momentum fc^ from minus infinity to plus infinity along the real axis. There is still the possibility that 
some of the loop propagators go on-shell for real values of the loop momentum. If the contour is not 
pinched this is harmless, as we may escape into the complex plane in a direction indicated by Feynman's 
+i5-prescription. However, it implies that the integration should be done over a region of real dimension 
4 in the complex space C^. Let us consider an integral corresponding to a primitive one-loop amplitude 
with n propagators minus the appropriate IR- and UV-subtraction terms: 

where P{k) is a polynomial of the loop momentum k^ and the integration is over a complex contour in 
order to avoid whenever possible the poles of the propagators. We set k^^ = k'^ + iK^^{k), where A;^ is 
real II170II . After this deformation our integral equals 



d'^k 

(2^ 



^k^' 



dk^ 



^(^(^))np 2 Y^W^v. • (29) 

^^ k'i — m^ — K^ + 2ikj • K 
j=l ] 3 •> 



To match Feynman's +f (5-prescription we have to construct the deformation vector k such that 

k]-m] = ^ -^ kj- K> 0. (30) 

We remark that the numerical stability of the Monte Carlo integration depends strongly on the definition 
of the deformation vector k. 
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3.5 NLO results for n-jets in electron-positron annihilation 

We have calculated results for jet observables in electron-positron annihilation, where the jets are defined 
by the Durham jet algorithm [il66il . The cross section for n jets normalised to the LO cross section for 
e+e" — )■ hadrons reads 

'2 / / w n-1 



a. 



n—jet 



(^) 



asifJ.) 



Anifl) 



asifi) 



One can expand the perturbative coefficient An and Bn in l/Nc- 
2 



Bn{li) + 0{al 



(31) 



/in 



Nr 



An,\c + O 



Br, 



Nr 



n-1 



Bn,\c + O 



1 



We calculate the leading order coefficient j4„_ic and the next-to-leading order coefficient Bn^yc forn < 7 
at the renormalisation scale /x equal to the centre-of-mass energy. The centre-of-mass energy is taken to 
be equal to the mass of the Z-boson. The scale variation can be restored from the renormalisation group 
equation. The calculation is done with five massless quark flavours. Fig. |5] shows the comparison of our 
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Fig. 5: Comparison of the NLO corrections to the two-, three- and four-jet rate between the numerical 
calculation and an analytic calculation. The error bars from the Monte Carlo integration are shown and 
are almost invisible. 



numerical approach with the well-known results for two, three and four jets Ill79[ll80lll81 1. We observe 
an excellent agreement. The results for five, six and seven jets for the jet parameter ycut = 0.0006 are 



iVi 
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^5,ic = (2.4764 ± 0.0002) • 10^ 
^6,ic = (2.874 ± 0.002) •10^ 



" A7,ic = (2.49 ± 0.08) • 10^ 



-S. 



5,1c 



16 
^B6,ic = (3.88 ± 0.18) -10^ 



;i.84±0.15) -10^ 



(5.4±0.3)-10^ 



(32) 



3.6 FIRST STEPS TOWARDS NNLO 

An NNLO calculation requires among other things also the calculation of the one-loop amplitude 
squared. The expansion in the dimensional regularisation parameter e of the one-loop amplitude starts at 
order (—2) one would naively expect that up to order e° the C(t)- and C'(e^)-terms of the one-loop am- 
plitude are needed for an NNLO calculation. However, it is by no means obvious how the approaches for 
one-loop amplitudes based on unitarity or the numerical method can be extended to include the higher- 
order terms in the e-expansion. It turns out that the computation of these higher-order terms can be 
avoided, provided a method is known to compute the finite two-loop remainder function. The one- and 
two-loop amplitudes can be written as II182II 



2(1)^(0) +^(1) 



minimal' 



(z(2) - Z(i)z(i)) ^(0) + Z(i)^(i) + T\ 



(2) 
minimal' 



(33) 
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where the operators iS-^^ and Z^^^ contain all the infrared poles and -T^minimai ^^^ -^minimal 
remainders. Here we used the convention that the operators Z^^) and Z^^^ contain only pole terms, but 
no terms of order e^ with /c > 0. This corresponds to a minimal scheme. The operators Z^^^ and Z^^-' are 
well-known. At NNLO it is sufficient to know the e°-terms of -^minimal ^"^^ -^minimal' ^^'^ ^^' ^^ e^-terms 
of A'^^^ or -7\Lj„i„„i are not required II183II . 
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4. TOWARDS THE AUTOMATION OF ONE-LOOP AMPLITUDES 

Abstract 

A program is presented that computes one-loop amplitudes automatically for 
processes with up to 6 external particles based on the Feynman-diagram ap- 
proach. Additionally, universal one-loop building blocks, which can be used 
to compute several processes at NLO QCD are calculated. 

4.1 INTRODUCTION 

The calculation of processes with multi-particle final states beyond the leading order approximation has 
been an active field of research during the last years as a consequence of the demand of high accuracy for 
signal and background processes at the LHC. A next-to-leading (NLO) calculation consists of virtual and 
real radiation processes which are infrared divergent (IR) separately and can be computed numerically 
only after extracting the divergences of the real radiation contributions. The one-loop virtual calculation 
for multiple particles poseses a challenge of complexity not only due to the large number of contributing 
diagrams, but also concerning the stability of the numerical code to evaluate them. In the last years, an 
enormous progress has been achieved applying new techniques and using traditional Feynman-diagram 
approach, leading to new NLO predictions. 

Due to the large number of processes of potential interest at the LHC, the scientific community 
has worked in the automation of the NLO calculations. The automation of the real contributions in- 
cluding their infrared subtraction terms has been successfully implemented in several packages and the 
automation of the virtual corrections, which is a harder problem, is currently being achieved in several 
programs (see II 18411 and references therein). 

In Ref. II185L the early stage of a program, in the framework of Mathematica 111 861 and Feyn- 
Calc 111871 . to compute automatically one-loop amplitudes based on traditional Feynman-diagram tech- 
niques and involving up to 2 — ;■ 4 processes was presented. This program will become publicly available 



in the future. The method used is described in Section 4.2 In Section 4.3 we present a set of univer- 
sal one-loop building blocks that has been used to compute recently several processes included in the 
VBFNLO package ll42ll4Tl . 

4.2 TOWARDS AN AUTOMATIC ONE-LOOP AMPLITUDE GENERATOR 

The program above mentioned automatically simplifies a set of amplitudes up to Hexagons of rank 
5. The result is given in terms of scalar and tensor integrals following the Passarino-Veltman conven- 
tion Illl21ll85ll . spinor chains, polarization vectors and model parameters. The simplified expression is 
written automatically to FORTRAN routines. For massless propagators, the amplitudes can be evaluated 
also in Mathematica with unlimited precision, which is used for testing purposes. To achieve that, the 
scalar integrals, the tensor reduction formalism to extract the tensor coefficient integrals, and also the 
helicity method described in Ref. 111881 11891 to compute the spinor products have been implemented at 

^Contributed by: F. Campanario 
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the FORTRAN and Mathematica level. For the determination of the tensor integrals up to the box level, 
the Passarino-Veltman tensor reduction formalism [ 1 12] is used applying the LU decomposition method 
to avoid the explicit calculation of inverse Gram matrices by solving a system of linear equations, which 
is a more stable procedure close to singular points. Finally, for singular Gram determinants, special 
tensor reduction routines following Ref. II115I have been implemented, however, the external momenta 
convention (Passarino-like) was used. The impact of these methods is discussed in detail in Ref. (TES). 
For pentagons, in addition to the Passarino-Veltman formalism, the method proposed by Denner and 
Dittmaier Illl5[|190ll . applied also to hexagons, has been implemented. For that, the recursion relations 
of Ref. II115II in terms of the Passarino-Veltman external momenta convention have been re-derived. This 
last method is used for the numerical implementation at the FORTRAN level. 

The Mathematica function does several algebraic manipulations that are summarized as follows: 

• Simultaneous extraction of rational terms based on Dirac algebra manipulations and cancelation 
of scalar products against propagators. 

• Reduction to a minimal basis of tensor and scalar integrals. 

• Reduction to a minimal basis of spinor chains. 

• The use of Chisholm identities, which are only valid in 4 dimensions, for the contraction of Lorentz 
indices among different spinor chains is applied, if selected. 

• Factorization of loop dependent and independent factors (Useful to perform gauge tests. Ward 
identities or the re-evaluation of the amplitudes for different helicity polarization of gluons and 
fermions at a lower CPU cost). 

As an example of the notation used, the following Hexagon diagram is used. This is written as 
follows: 



Mv,v,v,v„r = gr^fgr^fgr^f9r^fj^cri''^''^''^Ml\ (34) 



Vi V2 Vs V, 
where go is the strong unrenormalized coupling, C^^ ^ ^ '' is a color diagram dependent factor, e.g, 
^■y^g-r _ (^x^'j.j^^Cp _ 1/2Ca)- Qt are electroweak couplings and A^r represents the amplitude 
considering generic off-shell vector bosons with color indices ij for a given helicity r. The amplitude 
M.T, omitting color indices, is written in terms of 

Mr = M^=^ + {D- 4)Xf ^, (35) 

where M.^^^ is the amplitude that one would obtain performing the Dirac algebra manipulation in four 
dimensions, Z) = 4, and Ai^^ contains the rational terms and vanishes in Dimensional Reduction (DR.). 
These functions are decomposed in the form: 

^(^=^'^^) = ^SM,,F1,-, (36) 

hi 

where SMj^t- is a basis of Standard Matrix elements corresponding to spinor products describing the 
quark line of Eq. ([34]) which are computed following the helicity method II188[ I189II with a defined 



helicity, r. Flj are complex functions which are further decomposed into dependent and independent 
loop integral parts, 

Fl, = Y, ^iTk (e(Pn) • Pm-, e{p^) ■ e(p,)) • (37) 

l,k 

Tfc is a monomial function at most for each polarization vector e{px), i.e., e{px)'^ or e(pa)^. The first 
possibility, e{px)^, implies that the polarization vector appears in the set of Standard Matrix elements 
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SMj^r- Fi contains kinematic variables (pj • pj), the scalar integrals (Bq, Co, -Do), and the tensor integral 
coefficients {Bij, Cij, Dij, Eij, Fij). Then, the full result is obtained from TW:^^^ and M.^^ using the 
finite and the coefficients of the 1/e" poles of the scalar and tensor coefficient integrals: 

M^='' = M., + '-^ + '-^, {D-A)M^''=M. + '^, (38) 

e e^ e 

where, e.g., M.^ is the finite contribution obtained using the finite pieces of the scalar and tensor coef- 
ficient integrals including the finite contributions from rational terms arising in ultraviolet tensor coeffi- 
cient integrals. 

4.3 UNIVERSAL BUILDING BLOCKS 

Based on the observation that the same one-loop virtual amplitudes appear in many processes (Fig. [6]l, 
we are aiming to collect a basis of universal building blocks, which can be used to compute all of the 
2 — ;■ 4 processes at LHC at the QCD one-loop level (Similar to the philosophy of older versions of 
MADGRAPH II191I calling the HELAS 111921 routines). This methodology of collecting topologies in 
groups has been proved very successful in the program VBFNLO, where for example a boxline routine, 
first line of Fig.|6j is computed and applied to pp — )• VV , pp — )• VVV, pp — )• VVj and EW production 
of pp — >• Vjj andpp — )■ HVjj. 




Vi V2 Vi V2 Vi V2 Vi 




V'll IV2 VI i JVj Vii iV2 Vi 




Vi^ i___ Vi <" i:__ Fi <" ? _ _ _ Vi 



Fig. 6: Boxline contributions appearing in different processes. 

To do that, we use the effective current approach described and applied in Ref s. ll39l l47l [3T1 11931 
I2TI . As illustration, the first diagram of the second raw of Fig. [6]is used. This can be written as, 

^v^_V2VzVA,T = Jv{'^v}■^^^l^^2,T = Mv*v;,t, (39) 

where the color indices have been omitted. Here, Jyl and Jyl represent effective polarization vectors 
in the unitarity gauge for the EW sector including fimte width effects in the scheme of Ref s. 111941 11951 
and propagator factors, e.g., 
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with Fy*, the width of the Vi vector boson, and Iy*y y , the triple vertex, which can also contain the 
leptonic decay of the EW vector bosons including all off-shell effects or BSM physics. In this manner, we 
can then concentrate in computing, instead of Av^V2V3V4,t, the virtual correction to two massive vector 
bosons attached to the quark line, Aiv*v*,T, or equivalently M.fj,^^2,T, where the polarization vectors 
or effective currents have been factored out. In our approach, this basic building block is the so-called 
Boxline, which is computed only once and re-used in different processes. 

We plan to do a classification of all the topologies that appear at 1 loop level for up to 2 — ;• 4 pro- 
cesses and install a library with all the basic one-loop building blocks already computed and simplified. 
This would be an advantage since, for example for qq — )• WW production, up to 24 hexagons for a 
single subprocess would appear, corresponding to the permutations of the vector bosons on the hexagon 



of Eq. 34 In this approach, the amplitude is obtained by calling the same one-loop amplitude 24 times 
with the corresponding ordering of momenta and polarization vectors. We aim towards an automation 
of this procedure, which will result into a faster and shorter final FORTRAN code generation. The spe- 
cific building blocks are collected into groups with specific gauge and IR factorization properties, e.g, 
factorization of the IR divergences against the corresponding born, known behavior under Ward identity 
checks. 

In Fig. |7] we present the topologies that have been computed and tested. In the first line, correc- 
tions to a quark line with the emission of V„ vector bosons in a fixed order are represented for 4 different 
topologies. (The first 2 were explained in detail in Ref. [1851, including their stability behavior). We 
have only depicted the virtual amplitude with the higher complexity for a giving building block, e.g. the 
boxline of Figj6]is obtained from the first diagram with two vector bosons attached, i.e., n = 2 in Vn- The 
first two topologies of the second line are collected by putting together all possible Feynman-diagrams 
with a fixed order of the vector bosons and attaching it to the quark lines in all possible ways. The 
crossing of the fermion lines are treated as independent building blocks and are not depicted. Finally, the 
fermion-loop corrections for a fixed order of vector bosons, Vn, are computed in the last diagram of the 
second line 

The use of modular structure routines, as the above presented, has been proved to be an advantage 
in the program VBFNLO f4T. Wl] since once a structure is computed and checked it can be re-used for 
different processes. For example, using the building blocks of the first and second topology together with 
the fermion-loop diagrams, resuhs at NLO QCD for all VW 1391 gTl EH EH EH El, several Wj CT 
[32llI961[l97l . Hjjj II198II and W'j^j ||2T1 production channels have been computed recently. The last 
one representing the first calculation at this accuracy falling in the category of VW + j production. 
Up to the pentagon level, these building blocks are publicly available as part of the VBFNLO [4211411 
package together with the tensor reduction routines, excluding the routines for small Gram determinants 
which will become available in the future, in addition to the other building blocks. 

4.4 CONCLUSIONS 

A program which automatically evaluates one-loop amplitudes for up to 2 —)^ 4 processes has been pre- 
sented based on the traditional Feynman-diagram approach. The program has been developed in the 
framework of Mathematica and FeynCalc and writes down automatically the simplified expression to 
FORTRAN. Up to the pentagon level and for massless propagators, the code can be evaluated numer- 
ically inside Mathematica with unlimited precision which can be used for testing purposes. For the 
reduction of tensor integrals, we have developed a library that includes expansion for small Gram deter- 
minants. Using the leptonic tensor formalism, we are building a library of universal one-loop building 
blocks, which can be used to compute several processes at NLO QCD. Recently, following this strat- 
egy, we have reported results for all VW IM EH EH ESI 143^. "441 , several Wj EI [3l [Hi [T97l . 
Hjjj 1 198J and Wj^j ||2T1 production channels inside the VBFNLO collaboration. The ultimate goal 
is to generalize the library to compute all of the 2 — )• 4 processes at LHC at the QCD one-loop level, 
similar to the philosophy of older versions of MADGRAPH 119111 calling the HELAS II 19211 routines, 
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V„, n < 4 



Vi V„, n < 2 



Vi 



V„,, n < 2 




Fig. 7: Topologies of universal building one-loop blocks. Only the most complicated diagram of each 
topology is depicted, e.g, the boxline of Figj6]is obtained from the first diagram with two vector bosons 
attached, i.e., Vn, n = 2. 



and deliver a Mathematica package compatible with FeynArts II132L which can be used to compute full 
one-loop amplitudes automatically using the universal building blocks, resulting into a faster and shorter 
code generation. 
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5. THE TWO-LOOP QCD VIRTUAL AMPLITUDE FOR W PAIR PRODUCTION WITH 
FULL MASS dependence!!] 

5.1 INTRODUCTION 

One of the main aims in the Large Hadron Collider (LHC) physics program is undoubtedly the discovery 
(or the exclusion) of the Higgs boson which is responsible for the fermion and gauge boson masses 
and also part of the mechanism of dynamical breaking of the Electroweak (EW) symmetry. Another 
important goal for the LHC is the precise measurement of the hadronic production of gauge boson pairs, 
WW, WZ, ZZ, Wj, Z7, this in connection to the investigation of the non-Abelian gauge structure of 
the SM. W pair production, 

qq^W+W-, (41) 



Contributed by: G. Chachamis 
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plays an essential role as it serves as a signal process in the search for New Physics and also is the 
dominant irreducible background to the Higgs discovery channel pp ^ H ^ W*W* — )• Ivl'v' , in the 
intermediate Higgs mass range 1.199.1 . Both ATLAS and CMS collaborations have released first values 
for the WW cross section Il200ll20l1 . 

The process ^^ is currently known at next-to-leading order (NLO) accuracy Il2021l203[|204l 12051 



12061 11571 ?]. The NLO corrections were proven to be large enhancing the tree-level result by almost 
70% which falls to a (still) large 30% after imposing a jet veto. Therefore, if a theoretical estimate for 
the W pair production is to be compared against experimental measurements at the LHC, one is bound to 
go one order higher in the perturbative expansion, namely, to the next-to-next-to-leading order (NNLO). 
This would allow, in principle, an accuracy of around 10%. 

High accuracy for the W pair production is also needed when the process is studied as background 
to Higgs production in order to match accuracies between signal and background. The signal process 
for the Higgs discovery via gluon fusion, gg — )• H, as well as the process H — )• WW — )• IvVv' are 
known at NNLO ||2071 |208l |209l |210l IlII] EEl [HI IHl EB] HJH, whereas the EW corrections are 
known beyond NLO 112 1711 . Another process that needs to be included in the background is the W pair 
production in the loop induced gluon fusion channel, 

gg ^W+W- . (42) 

The latter contributes at 0{a1) relative to the quark-anti-quark-annihilation channel but is nevertheless 
enhanced due to the large gluon flux at the LHC 112 18112 191 

The first main difficulty in studying the NNLO QCD corrections for W pair production is the 
calculation of the two-loop virtual amplitude since it is a 2 — ;• 2 process with massive external particles. 
We have already computed the virtual corrections at the high energy limit 112201 l97ll22 111 . However, this 
is not enough as it cannot cover the kinematical region close to threshold. Therefore, in order to cover 
all kinematical regions we proceed as follows. We perform a deep expansion in the W mass around 
the high energy limit which in combination with the method of numerical integration of differential 
equations II222I 122311224 ] allows us the numerical computation of the two-loop amplitude with full mass 
dependence over the whole phase space. 

5.2 THE HIGH ENERGY LIMIT 

The methodology for obtaining the massive amplitude in the high energy limit, namely the limit where 
all the invariants are much larger than the W mass, is similar to the one followed in Refs. i 225l|226i| . The 
amplitude is reduced to an expression that only contains a small number of integrals (master integrals) 
with the help of the Laporta algorithm II227II . In the calculation for the two-loop amplitude there are 71 
master integrals. Next step is the construction, in a fully automatised way, of the Mellin-Bames (MB) 
representations II228II2291 of all the master integrals by using the MBrepresentation package [230|. The 
representations are then analytically continued in the number of space-time dimensions by means of the 
MB package II231II . thus revealing the full singularity structure. An asymptotic expansion in the mass 
parameter (W mass) is performed by closing contours and the integrals are finally resummed, either with 
the help of XSummer f2321 or the PSLQ algorithm II233I . The result is expressed in terms of harmonic 
poly logarithms. 

5.3 POWER CORRECTIONS AND NUMERICAL EVALUATION 

The high energy limit by itself is not enough, as was mentioned before. The next step, following the 
methods applied in Ref. [234], is to compute power corrections in the W mass. Power corrections are 
good enough to cover most of the phase space, apart from the region near threshold as well as the regions 
corresponding to small angle scattering. 
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We recapitulate here some of the notation of Ref. 1122 11 for completeness. The charged vector- 
boson production in the leading partonic scattering process corresponds to 

q{Pi) + q{P2) -^ W-{p3,m) + W+{p4,m), (43) 

where pi denote the quark and W momenta and m is the mass of the W boson. 

We have chosen to express the amplitude in terms of the kinematic variables x and m,^ which are 
defined to be 

^ '^^ 

X = --, nis = — , (44) 

s s 

where 

s = {pi+ Pt)^ and t = {pi - ps)^ - m^ . (45) 

The variation then of x within the range [1/2(1 — /?), 1/2(1 + (3)], where /3 = y^l — Am? / s is the 
velocity, corresponds to angular variation between the forward and backward scattering. 
It should be evident that any master integral Mi can be written then as 

I 
Mi = Mi {nis , x, e) = ^ e' Uj (m^ , x) , (46) 

j=k 

where e is the usual regulator in dimensional regularization (d = 4 — 2e) and the lowest power of e in the 
sum can be —4. 

The crucial point now is that the derivative of any Feynman integral with respect to any kinematical 
variable is again a Feynman integral with possibly higher powers of denominators or numerators which 
can also be reduced anew in terms of the initial set of master integrals. This means that one can construct 
a partially triangular system of differential equations in the mass, which can subsequently be solved in 
the form of a power series expansion, with the expansion parameter in our case being nis following the 
conventions above. 

Let us differentiate with respect to m^ and x, we will then have respectively 

d 



m. 



'dm, 

and 

dx 



-Mi{ms,x,e) = ^Cjj(ms,x,e) Mj{ms,x,e) (47) 



x—Mi{ms,x,e) = '^C'ij{ms,x,e) Mj{ms,x,e) . (48) 



We use Eq. ( 47 1 to obtain the mass corrections for the master integrals calculating the power series 
expansion up to order tti]^ (see also Ref. [234] for more details). This deep expansion in m^ should be 
sufficient for most of the phase space but still not enough to cover the whole allowed kinematical region. 
The way to proceed from this point is to numerically integrate the system of differential equations. 



In particular, we choose to work with the master integrals in the form of Eq. (46 1, where the e 



dependence is explicit. We can then work with the coefficients of the e terms and accordingly have 

ms- — li {ms , a;) = ^ J^ (m^ , x) Ij {m^ , x) (49) 

i 

and 

x—Ii{ms,x) = '^J^{ms,x) Ij{ms,x), (50) 

j 

where the Jacobian matrices J^^ and J^ have rational function elements. 
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By using this last system of differential equations, one can obtain a full numerical solution to the 
problem. What we are essentially dealing now with is an initial value problem and the main requirement 
is to have the initial conditions to proper accuracy. The initial conditions, namely the values of the 
master integrals at a proper kinematical point which we call initial point, are provided by the power 
series expansion. The initial point has to be chosen somewhere in the high energy limit region, where 
rus is small and therefore, the values obtained by the power series are very accurate. Starting from there, 
one can evolve to any other point of the phase space by numerically integrating the system of differential 



equations Eq. (49 1 and Eq. (50 1. 



We parametrise with a suitable grid of points the region close to threshold and then we calculate 
the master integrals for all points of the grid by evolving as described previously. Given that the master 
integrals have to be very smooth (we remain above all thresholds) one can use, after having the values 
for the grid points, interpolation to get the values at any point of the region. We use 1600 points for the 
grid and take as initial conditions the values of the master integrals at the point rUs = 5 x 10^'^, x = 1/4. 
The relative errors at that point were estimated not to exceed 10~^^. 

The numerical integration is performed by using one of the most advanced software packages 
implementing the variable coefficient multistep method (ODEPACK) [235|. We use quadruple precision 
to maximise accuracy. The values at any single grid point can be obtained in about 15 minutes in average 
(with a typical 2GHz Intel Core 2 Duo system) after compilation with the Intel Fortran compiler. The 
accuracy is around 10 digits for most of the points of the grid. It is also worth noting that in order to 
perform the numerical integration one needs to deform the contour in the complex plane away from the 
real axis. This is due to the fact that along the real axis there are spurious singularities. We use an elliptic 
contour and we achieve a better estimate of the final global error by calculating more than once for each 
point of the grid, using each time different eccentricities. Grids of solutions can actually be constructed, 
which will be subsequently interpolated when implemented as part of a Monte Carlo program. 

One very stringent test we use to cross-check the correctness and also the accuracy of our cal- 
culation is to compare the infrared pole structure of our two-loop result against the one predicted by 
Catani II182II (see also Refs. II236I 12371 12381 ). According to Catani, the infrared poles of the interference 
of the tree and the two-loop amplitudes follow a generic formula which in our case, since we work with 
the rescaled variables rus and x, can be cast into the following form: 

cSims,x, -) = 2Re{l«(ffl)(MW|M(i)) +l(2)(ffl)(MW|M(0))} , (51) 

where M^''^ and M^^-* are the tree level and one-loop amplitudes respectively and fi is the renormalization 
scale. The operators l(^)(fil) and l(^)(ffl) encode the information for the infrared pole structure and their 
exact expressions can be found in Ref. ll97l . 

The way to perform the test is straightforward. For each point of the grid with coordinates 
(m,5(j), X(j)), we compute the numerical value of the two-loop amplitude (M^^^) interfered with the tree 
level amplitude 

A^'>'^\m,^,),x„ -) = (m(0)|m(2)) + (m(2)|m(0)) (52) 

A*- 

by numerically integrating the differential equations as described previously and we also calculate the 



numerical value of the quantity Qia„j ('7is(i),a^(j), -) by using Eq. (51 1. Then, all we need to make 



sure is that the infrared singularities of the quantity I ^(°^^)(mg(j),Xi, ^) — Qta„j ("^s(j))^j) ^) f can- 
cel numerically for every point (?7i5(j), X(j)) of the grid (ultraviolet divergencies have been removed by 
renormalization). We will not present here any numbers since the aim was to describe the general meth- 
ods. The details and the results of the study will be presented in a future publication 112391 . 
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5.4 CONCLUSIONS 

W pair production via quark-anti-quark-annihilation is an important signal process in the search for New 
Physics as well as the dominant irreducible background for one of the main Higgs discovery channels: 
H — )• WW — )• 4 leptons. Therefore, the accurate knowledge of this process is essential for the LHC. 
After having calculated the two-loop and the one-loop-squared virtual QCD corrections to the W boson 
pair production in the high energy limit we proceed to the next step. Namely, we use a combination of a 
deep expansion in the W mass around the high energy limit and of numerical integration of differential 
equations to compute the two-loop amplitude with full mass dependence over the whole phase space. A 
strigent cross-check of our calculation is to verify that the infrared structure of our result agrees with the 
prediction of the Catani formalism for the infrared structure of QCD amplitudes. 
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6. COMPUTATION OF INTEGRATED SUBTRACTION TERMS NUMERICALLY 

Abstract 

We report on a numerical representation of the integrated subtraction terms of 
the NNLO subtraction scheme defined in Refs. 1,2401 12411 l242ll243]| . The in- 
tegrated approximate cross sections themselves can be written as products of 
insertion operators (in colour space) times the Bom, or the one-loop cross sec- 
tion. The insertion operator is constructed from the numerical representation 
of the integrated subtraction terms. We give selected results for the integrated 
doubly-coUinear subtraction term. 

6.1 INTRODUCTION 

We consider the NNLO correction to a generic rn,-jet observable. 



a 



NNLO 



m+2 



^^m+2-Jm+2 + 



m+l 



dcr^_l_x Jm+1 + / dcr^ Jm ■ 



(53) 



The three contributions on the right hand side are separately divergent in d = 4 dimensions, but their 
sum is finite for IR safe observables. To obtain the finite NNLO correction, we first continue analytically 



all integrals to d = 4 — 2e dimensions and then rewrite Eqn. (53 1 as 

NNLO f j^NNLO , f j^NNLO , f a^ 

J m+2 Jm+1 Jm 

that is a sum of three integrals where the integrands. 



1 NNLO 
^"m+2 



'm+2'^m+2 



1 ixix,i±2 J 
^^m+2 ^" 



da. 



RR.Ai 



Jri 



NNLO 
m 1 



24-2 <^m+l 



da 



RR,Ai 

m+2 



i 1 



(54) 



I , (55) 



Q"m+1 — 1 °-'^m+l + / '^^m+2 



RR,Ai 



J, 



and 

^ NNLO 



davv + 



, RR,A2 J 
d^m+2 -df^^ 



m+l 



RR,Ai2 
m+2 



, RV,Ai , ( f . RR,Ai\Ail J \ ^, 



+ 



1 RV,Ai , ( I A RR,Ai\Ai 
d<^m+l + / dCT„+2 



I Jm, (57) 



'^Contributed by: G. Somogyi, Z. Szor, Z. Trocsanyi 
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are integrable in four dimensions by construction. The approximate cross sections d(T^_,_'2 ^ and dcr^^'2 ^ 
regularise the doubly- and singly-unresolved limits of the real-emission contribution, dcr^^2 respec- 
tively. The double subtraction due to the overlap of these two terms is compensated by dcr^_,_'2 ^^. 
These terms are given explicitly in Ref . II242II . Finally, da^_^i ^ and ( /^ 'i^m+2 ^ ) ^ regularise the 
singly-unresolved limits of da^_^_i and J^ d(T^_,_'2 ^ respectively. They ai^e given explicitly in Ref. f 243 |. 

The construction of each approximate cross section in Eqns. (55 -|57 1 is based on the known and 
universal IR limits of tree level and one-loop squared matrix elements, and proceeds in two steps. First, 
the IR factorisation formulae are written in such a way that their complicated overlap structure can be 
disentangled ("matching of limits") II240[|244 I. Second, we define "extensions" of the formulae, so that 
they are unambiguously defined away from the strict IR limits 1124 1112421 12431 . These extensions are 
defined by the use of various momentum mappings that map a set of m + 1 or ttj + 2 momenta into a set 
of m momenta, 

{p}m+l > {P}m and {p}m+2 > {p}m , (58) 

such that (i) the delicate structure of cancellations among the matched limit formulae in various limits is 
respected (ii) exact momentum conservation is implemented, and (iii) the original m+1 or m + 2 particle 
phase space factorises exactly into the product of an m particle phase space and a one- or two-particle 
phase space measure, 

d(pm+ri{p}m+r;Q) = d(pm{{p}m] Q)[dpr,m] , r = 1,2. (59) 

To finish the definition of the scheme, one must compute once and for all the one- and two-particle 



integrals, denoted formally as f^ and f^, appearing in Eqns. (56-57). 

In general the integrated subtraction terms are integrals of extensions over the whole phase space 
of combinations of the QCD splitting functions and squared soft currents. In this proceedings we discuss 
two examples: (i) the singly-coUinear subtractions Q^' and (ii) the doubly-coUinear subtractions Q^,' j, 

RR A RR Ao J I 

which are part of d(T^_^'2 ^ and do-^_,_'2 in Eqn. (55 1, respectively. The precise definitions of these terms 



can be found in Ref. Ii242il . The meaning of the superscript is irrelevant for our present purpose (also 
explained in Ref. L2421 ). 

Denoting a generic subtraction term by X^^'^) (such as C^^ ) the integrated counterterms can be 
written in the following general form: 



■^(i,k) 



"'5 ' ^ 



27r '\Q 



r+e 



Nx{e)x('Hx, . . .)Re{M^^H{p})\T, • T, . . . \M^^H{p})) , (60) 



where S^ = (47r)^/r(l — e) , and X^^\x, . . .) represents a function that depends on kinematical invari- 
ants of the factorized ?7i-parton phase space. It results in the integration of the subtraction term Af(^'^) 



over the factorized phase spaces [dp^.m] in Eqn. (59 1. In a NNLO computation the possible cases are 



r + i + k = 1 with £ + fc = or 1, and r = 2 with £ + k = 0. We use the colour- and spin-state notation 
of Ref. II236L when the amplitude for a scattering process involving m final-state momenta, \A4m ), is 
an abstract vector in colour and spin space; k denotes the number of loops. Colour interactions at QCD 
vertices are represented by associating colour charges Tj with the emission of a gluon from each parton 
i. There are 2r such colour charges. Then the functions X^^'> are dimensionless in colour-space. For cer- 
tain subtraction terms, universal, possibly e-dependent numerical factors, Nx{e) appear naturally, which 
can be factored out. Our purpose is to compute all functions X^^\ which we discuss next. 

6.2 INTEGRATING THE COUNTERTERMS 

The actual computation of the integrated counterterms leads to a large number of multi-dimensional 
integrals. The ultimate goal is to find the analytical form of the coefficients of a Laurent expansion (in 
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e) of these integrals, which turns out to be a rather tedious job. In order to compute these coefficients as 
efficiently as possible, we have explored several methods. 

First, it is possible to extend the method of integration-by-parts identities and solving of differential 
equations, developed for computing multi-loop Feynman integrals l|245[|246i] , to the relevant phase space 
integrations II247II . This method yields e-expansions with fully analytical coefficients, with the final 
results being expressed in terms of two-dimensional harmonic polylogarithms (after a suitable basis 
extension, see Ref. [i247il for details). This approach was used successfully to compute a class of singly- 
unresolved integrals | I247| . 

Second, the phase space integrals that arise can be computed via the method of Mellin-Barnes 
(MB) representations II228I 12291 12481 . Here we obtain the e-expansion coefficients in terms of complex 
contour integrals over F-functions. Performing these integrals by the use of the residue theorem, a 
representation in terms of harmonic sums is obtained. In many cases, the sums can be evaluated in a 
closed form, yielding an analytical result. In some instances however, we find multi-dimensional MB 
integrals that are very difficult to compute fully analytically. Nevertheless, in these situations a direct 
numerical evaluation of the appropriate MB representations provides a fast and reliable way to obtain 
final results with small numerical uncertainties. We stress that for phenomenological applications, this 
is all that is required, since the numerical uncertainty of the complete computation is dominated by the 
phase space integrations. We have used the MB method to compute all singly-unresolved integrals II249L 
and all two-particle integrals appearing in J^ d(T^_,_2 ^^ as well JOTl . 

Finally, the method of iterated sector decomposition II250I can also be used to calculate the inte- 
grals we encounter [.251,1 . Sector decomposition produces a representation of the e-expansion where the 
coefficients are given in terms of (mostly quite cumbersome) finite integrals over the unit hypercube. The 
analytical evaluation of these integrals is not feasible except for the simplest cases. Nevertheless, this 
method is simple to implement and can be automated to a large extent. In fact there are several computer 
programs that use various implementations of sector decomposition to provide numerical values of coef- 
ficients of the powers of e in the Laurent expansion of dimensionally regulated integrals II252I 12531 I254II . 
We found the program SecDec powerful and flexible to generate sufficiently precise values of our inte- 
grated subtraction terms. 

Choosing the Cuhre integrator implemented in SecDec, we can easily reach 10^^ relative preci- 
sion for the integration. Such precision is sufficient for our purposes: (i) to demonstrate the cancellation 



of the e poles numerically, and (ii) to compute the finite integrals in Eqns. ( 56 1 and ( 57 1. As the numeri- 
cal uncertainty of the second item is limited more by the Monte Carlo integration over the m + 1 and m 
particle phase spaces, for item (ii) much lower (not better than 10^'^) precision is sufficient. This looser 
requirement on the precision for the 0(1) terms and the fact that the integrated subtraction terms are 
smooth functions of their parameters, with logarithmic behaviour for asymptotically small values of the 
parameters, makes possible that we find sufficient approximations to the integrated subtraction terms. 

6.3 APPROXIMATE INTEGRATED SUBTRACTION TERMS 

The computation of the integrated subtraction terms at any given values of the kinematical parameters, as 
required in the Monte Carlo integration over the phase space, is not feasible. In order to demonstrate the 
cancellation of the e poles numerically we can choose several randomly selected phase space points and 
evaluate the necessary integrals with high precision. The cancellation cannot depend on the particular 
phase space point. In the case of the finite remainders, in order to compute the phase space integrals in 



Eqns. (56 1 and ( pTj ), we are able to find sufficiently precise approximations to the integrated subtraction 
terms using a procedure that can be automated to high degree. The latter point is also important as there 
are several hundred integrals to compute. In the following, we outline our procedure for two cases: (i) an 
example with integrals depending on one kinematical parameter, (J^ ^(^'^) = J^ Q^' -') and (ii) another 
example with integrals depending on two kinematical parameters, {J^ X^^'''^ = J^ C, 



(0,0), 
ir,js . 
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.(^,0) 



In order to compute f-^ Q^' , we have to integrate the azimuthally averaged AltareUi-Parisi spUt- 



ting functions Pr ! (zj^, z^^j; e) in 4 — 2e dimensions for the sphtting process /j^ — )• /j + Z^, with Zj being 



the momentum fraction of parton /j. It was discussed in Ref. II249I that the corresponding functions C\J 
can be expressed as combinations of the integrals (we changed the notation from T to Xc) 

Ic{x;e,ao,do,K,k,6,g} >) = —-—Q^^ [dp^m+il ^1:^:^9} '{zr)f{ao,air,d{m,e)). (61) 

In terms of explicit integration variables these coUinear integrals have the general form II249I 

Icix; e, ao, do; k, k, 5, ^f ^) = x da 0''^-^^+^^' (1 - af'^''-^ [a + {I - a)x]-^-^'^+''^' 

Jo 

X r\v[v (1 - .)]- f " + ^^ - «)x. y-^^^ i±) ^ « + (1 - a)xv \ 
Jo \2a + {l-a)xJ ^^ \2a + {l-a)xJ 

The necessary functions g\ are listed in Ref. 112491 . where analytic results of these integrals for ao = 1 
and do = 3 are also presented. 

Our present goal is to provide sufficiently precise numerical approximations to the functions Ic{x) 
in a simple way. The motivation is that often it is difficult to perform the analytic computation with 
arbitrary values of the parameters. For instance, the derivation with ao = I is rather different from a 
derivation with ao < 1. Also, the choice for do is to some extent arbitrary, and a new choice requires 
a completely new analytic computation. Thus, for the sake of flexibility we propose a fully numerical 
approach here. 

First we used the program SecDec, modified such that it can compute the value of the integral at 
multiple values of the parameter x in a single run. For simplicity, we call the 0(1) terms of the integral 
'measurements'. Then, inspired by the analytic results in Ref. II249L we fitted these measurements by 
combinations of logarithms and polynomials in x of the form 



Tc{x;K = 0,k,6 = 0,g^^^ = l)=^PJr\x,k)log^x), Pi-)(x, A;) = £ a(f)x" (63) 



where the upper limit rimax is determined by the power — rimax of the leading pole in the Laurent- 
expansion (in e) of the integral. As for the degree of the polynomials we tried several simple choices 
(m = 1, 2, 3). We found that splitting the region of the parameter space into an asymptotic (0 < x < 
10~^) and a non-asymptotic (10~^ < x < I) region, we could provide a fit with m = 2 that approximates 
the analytic result within relative difference few times 10~^. The loss of relative precision is associated 
with phase space points where the function changes sign, and its numerical value is close to zero (around 
X = 0.2). 

In Fig.lslwe show the approximate function Fc{x] ooi do^ 0, —1, 0, 1) together with the 'measure- 
ments', which coincide with the known exact analytic result to at least six digit accuracy. We find very 
good agreement, which is characterized by the ratio of the two values in the lower panels. In Fig. [Sf? 
we show the approximate function for ao = 0.1 and do = 3 — 3e together with the corresponding 
'measurements'. In this case the analytic results are not available. 

Building on the experience gained in studying the one-parameter case, we worked out a similar 
strategy for the integrated subtraction term /gC^y' j. The corresponding functions Q^'^ can be ex- 
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Fig. 8: The fitted function Tc[x\ 0, —1, 0, 1) compared to tlie integral X(x; 0, —1, 0, 1) at a) ao = 1 and 
do = 3, b) ao = 0.1 and do = 3 — 3e. 



pressed as combination of the integrals 

l2cixi,Xj]e,ao,do;k,l) = XiXj da d/3 6'(ao - a - /?) 











X (1 - a - /3)2'^''-2(i-^)a-i-^/3-i-^ (a + (1 - a - /3)xi)"^"' (/3 + (1 - a - /3)xj)" 



■l-e 



Jo Jo \2a + il-a-l3)xiJ \2/3 + {1 - a - /3)xj J 

(64) 

Weagainrun SecDec withao = 0. land do = 3— Seat several hundred different values of the kinematic 
parameters to obtain the 'measurements'. To reach 10^^ relative precision for all such 'measurements' 
takes several hours on a single CPU. Then we fitted these 'measurements' with the function 

T2c{xi,xf,k,l)=Y^ ^ P^^)(x„A:,OPi;")(x,,A;,Olog"'(x.)log"^(x,). (65) 

ni=0 nj=0 

We divide the parameter space < Xj, Xj < 1 into four regions: (i) < Xi, Xj < 10~^, (ii) < Xi < 
10-2 and 10-"^ < Xj < 1, (iii) < Xj < 10"^ and 10"^ < Xi < I, (iv) 10"^ < Xi,Xj < 1. Using 
m = 2, we are able to fit the original function I2C to per mille precision almost everywhere. The ratio 
of the fitted function T2C to the numerical evaluation of X2C is shown in Fig. |9] together whith the fitted 
function T2C itself. 

CONCLUSIONS 

We have worked out a numerical procedure for providing simple approximations of the integrated sub- 
traction terms of the NNLO subtraction scheme defined in Refs. Il2401l24lll242[|243l . We use the publicly 
available program SecDec to compute the coefficients of the Laurent expansion of the necessary inte- 
grals to high numerical precision. We found that the integrals that depend on one or two kinematical 
invariants can be approximated with simple combinations of polynomials and logarithms. The precision 
of these approximations is usually at per mille or better. 
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Part III 

PARTON DISTRIBUTION FUNCTIONS 

7, WHICH EXPERIMENTS CONSTRAIN THE GLUON PDF IN A GLOBAL QCD FIT? f\ 

Abstract 

Based on computation of PDF-induced correlations, we identify the experi- 
ments in CTEQ and MSTW global QCD analyses that are sensitive to the 
gluon parton density in the proton. The Tevatron inclusive jet production at 
large momentum fractions x and DIS charm quark production at moderately 
small X show the strongest correlation with the gluon PDF The strength of the 
PDF-induced correlation between the gluon PDF and inclusive (di)jet produc- 
tion data is different in the CTEQ and MSTW analyses. 

7.1 Introduction 

The parton distribution function (PDF) of gluons in a proton, g{x, ^), plays an important role in hadron 
collider phenomenology. It arises in cross sections for production of hadronic final states, massive scalar 

'Contributed by: Z. Liang, P. M. Nadolsky 
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Fig. 10: CTIO and CTEQ 6.6 PDF uncertainty bands at /i = 2 GeV (left) and 100 GeV (right), taken 
from Ref. II255II . The CTEQ 6.6 best-fit PDFs and uncertainties are indicated by solid curves and hatched 
bands, while those of CTIO are indicated by dashed curves and dotted bands. 



bosons, and hypothetical elementary particles, often in a combination with an overall normalization 
prefactor proportional to a^. The gluon distributions from CTIO II255II and CTEQ 6.6 II256I PDF sets are 



shown in Fig 10 The figure shows that the gluon PDF is constrained well by fitted experiments at the 
intermediate momentum fractions x, but the uncertainty grows in the region x > 0.1. We may ask which 
experiments in the global fit impose the most significant constraints on the the gluon PDF. It is often 
said that the precise neutral-current DIS data provides the tightest constraints on the gluon PDF at x of 
order 10~^, while inclusive jet production at the Tevatron plays the key role in constraining the gluon at 
X > 0.1. However, the net PDF uncertainty reflects subtle interplay of numerous constraints imposed 
by QCD theory and multiple experiments, as well as various correlated uncertainties in experimental 
measurements. In this contribution, we identify the experiments with the strongest sensitivity to the 
gluon PDF by using a method of PDF-induced correlations that was developed in Refs. Il256[ 12571 12581 . 
The analysis of correlations provides a systematic way to identify such experiments and also to establish 
specific ranges of x and Q where the correlations of the experimental data sets with the gluon PDF are 
the most pronounced. 



7.2 Log-likelihood x^ ^nd PDF-induced correlations 

The quality of theory description of an experimental data set can be quantified by the log-likelihood 
function x^- Many high-energy physics experiments publish three kinds of measurement errors for each 
data point i: the statistical error Cj, uncorrected systematic error Uj, and correlated systematic errors 
{/3ij, /32i, P3i....(3Ki} of K different types. To compare a theory prediction Tj to the data value Di for a 
data point i, while accounting for all types of errors, the x^ function can be constructed as 112591 1260]| 
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(66) 



where a? = af + uf is the combined uncorrelated error; r^ are random parameters describing each of 
K correlated errors (each distributed according to the standard normal distribution); A'^e is the number of 
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the data points; and K is the number of the sources of the correlated systematic errors. 



Analytic minimization of the function ( 66 1 with respect to the correlated systematic parameters r^ 
renders the following result 112571 I259II : 



where A^k' and Bk are given by 

-^kk' = ^kk' 
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(67) 
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(68) 



Substituting Eq. (67 1 into Eq. (66 1, we obtain a reduced y^ function II257[ 12591 . 
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(69) 



In this function, the information about the systematic shifts in r^ is included implicitly. Often, the 
influence of the correlated shifts on the PDFs is substantial. 

Next, we wish to discuss correlations between PDF uncertainties of two variables, X{a) and Y{a), 
where a = {ai, 02, ..., aAr} is the vector of N PDF parameters. The correlations can be computed either 
in the Hessian II2561I2571 1258.1 or Monte-Carlo 1.261,1 approaches. In this note we will adopt the Hessian 
approach. 

A symmetric PDF uncertainty AX corresponds to the maximal variation of X for all combinations 
of PDF parameters that lie within the tolerance hypersphere Ax^ < T^- This uncertainty is given by 



AX 
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N 



Ei^t 



XT 



(70) 



i=l 



in terms of the value Xq of X obtained with the central PDF set, and values Xf and X~ of X ob- 
tained for maximal positive and negative displacements of each orthonormal PDF parameter Oj within 
the tolerance hypersphere. The same "master equation" defines AY, the PDF uncertainty of the variable 
Y. 

In the linear approximation, the pairs of values of X and Y that are allowed within the PDF 
uncertainty correspond to the points inside an ellipse in the X-Y plane. The boundary of the ellipse is 
parametrically described by 



X = XQ + AXcose, 

Y = Yq + AY cos{e + ^), 



(71) 
(72) 



where the parameter 9 varies between and 27r, and the relative phase angle (/? is a function of Xf and 
y^ . The PDF uncertainties AX and AY are calculated according to Eq. (70 1. The angle ip is included 



between the gradients VX and VY of X and Y in the PDF parameter space. Its cosine. 



cose/? 



VX-VY 
AXAY 



1 



N 

Iaxay^ 



(+) 



X 



(-) 



y. 



(+) 



Y 



(-) 



(73) 



quantifies the degree of similarity in the PDF dependence of X and Y. If X and Y are strongly correlated 
(corresponding to cos y? — )• 1) or anti-correlated (cos ip ^ —I), the PDF uncertainties of X and Y are 
driven by essentially the same combinations of PDF parameters. Conversely, the PDF dependence of X 
is independent from the PDF dependence of Y if cos if ^ 0. 
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7.3 Which experiments are sensitive to the gluon PDF? 

If an experimental cross section a strongly constrains a PDF fa{x, Q) for some combination of x and Q, 



we expect that Eq. (73 1 returns | cos ip\ close to unity when using X = fa/Ai^i Q) ^nd Y = a. \i the 
experimental data set includes several data points, we can use Y = x^- The strength of the constraint 
on the PDF from this experiment is determined by | cos Lp\ and the magnitude of x^- In the majority of 
the fitted experiments, x^ /Nf. is close to 1, so that | cos (^| tends to be more important for distinguishing 
between the sensitivities of the experiments than the magnitude of x^. 

Following this approach, we compute cos ^p between the NLO gluon PDF g{x, Q) in various x 
ranges (for Q^ = 10 GeV^), and x^ for typical experimental data sets that are used in the PDF analysis. 
In this study, we compute cos c/? for the experiments from the CTIO analysis that are listed in Table|4] In 
the figures, we refer to each experiment by its numerical ID that is shown in the left column of Table |4] 

The cos If values between the gluon PDF at a given x value and x^ for each experiment are 
plotted as two-dimensional contour plots for CTIO NLO PDFs [255] in the left panel of Fig.[TT] and for 
MSTW'08 NLO PDFs II262II in the right panel. The horizontal axis indicates the range of x in g{x, Q). 
The vertical axis indicates the ID of the experiment. At the bottom of the figure, we show the color legend 
adopted to draw the contour plots. The color legend is chosen so as to emphasize only cells with large 
correlation (cos 99 > 0.5, dark yellow-red colors) or large anticorrelation (cos 99 < —0.5, blue colors). 
The regions with | cos 99 1 < 0.5 are filled with a light-yellow color. The x? values for each data set are 



computed according to Eqs. ( 66 1 and ( 69 1 using the CTEQ fitting code for both CTIO and MSTW PDF 
sets. 



Visual inspection of two panels of Fig. 1 1 reveals both similarities and differences in the pattern 
of correlations of the gluon PDF in the CTIO and MSTW PDF sets. In the case of the CTIO PDF (left 
panel), the gluon PDF has a pronounced anti-correlation (blue spots) with HERA charm and bottom 
SIDIS production data sets (experiments 140, 143, 145, 156, 157) at x < 0.1, as well as with Tevatron 
inclusive jet production data sets (experiments 504, 505, 514, and 515) at x > 0.05. Some correlations 
(brown and red spots) are also observed, but they are not as pronounced as the anti-correlations. Weaker 
(anti-)correlations can be noticed with the NMC F2, CDHSW Fl', and E605 pp Drell-Yan process data, 
corresponding to experiments 103, 108, and 201. 



While the gluon PDF of the MSTW'08 set (right panel of Fig. 1 1 1 also shows an (anti-)correlation 
with the heavy-quark DIS and jet production data, the overall pattern of the correlations is somewhat dif- 
ferent from the CTIO case. Here, the gluon PDF is mostly correlated with high-x jet production (experi- 
ments 504, 505, 514, and 515), while it is either correlated or anti-correlated with heavy-quark DIS ex- 
periments (experiments 140, 143, 145, 156, 157). In addition, we observe significant (anti-)correlations 
with the combined HERA DIS data set (ID=159) and fixed-target DIS experiments (ID=101-124) that 
are not seen in the CTIO panel. 

We now turn to the correlations of the gluon and u-quark PDFs with x^ values in individual bins of 
Tevatron inclusive jet and dijet production data. For this purpose, we represent x^ for one experimental 



data set in Eq. ( 69 1 as a sum of contributions xt from individual data points i: 



x' 



Y.^1 (74) 



i=l 



where 

Ne. [ ^ ^ K 



Xi 



2 _ Di — Ti Y^ J Dj — Tj Y^ ^ki .-1 Pk'^ 



E^«-i^^E^-*^.^ ■ (^« 



* i=i I ^ k,k'=i 



Each contribution xf accounts for the effect of correlated systematic shifts through the term that includes 
^kk' '^^ ^^^ right-hand side of Eq. (75 1. Again, the constraining power of each point is determined both 
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Fig. 11: Correlation between the gluon distribution from CTIO NLO (left) and MSTW2008 NLO (right) 
PDF sets and x^ for the experiments used in the CTIO global QCD analysis. The color of each cell 
indicates the value of cos ip according to the included legend. The ID's of individual experiments are 
listed in Table |4l 
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ID 


Experimental data set 


159 


Combined HERAl NC and CC DIS 112631 


101 


BCDMS F^ 112641 


102 


BCDMS Fi 12651 


103 


NMC F^ 112661 


104 


NMC F!^/FI^ 12661 


108 


CDHSW F^ 112671 


109 


CDHSW F^ 1267] 


110 


CCFR F^ 12681 


111 


CCFR xFf |269j 


124 


NuTeV neutrino dimuon SIDIS 112701 


125 


NuTeV antineutrino dimuon SIDIS II270I1 


126 


CCFR neutrino dimuon SIDIS P27JJ 


127 


CCFR antineutrino dimuon SIDIS 112711 


140 


HI F| 112721 


143 


HI < force 1273112741 


145 


HI a^ for 65 1273112741 


156 


ZEUS F!i 12751 


157 


ZEUS F^ 12761 


201 


E605 Drell-Yan process, a{pA) II277B 


203 


E866 Drell Yan process, a[pd)l{2a{pp)) 12781 


204 


E866 Drell-Yan process, aipp) 12791 


225 


CDF Run-1 W charge asymmetry 12801 


227 


CDF Run-2 W charge asymmetry [28 IJ 


231-234 


D0 Run-2 W charge asymmetry [282] 


260 


D0 Run-2 Z rapidity distribution [283| 


261 


CDF Run-2 Z rapidity distribution 12841 


504 


CDF Run-2 inclusive jet production 12851 


505 


CDF Run-1 inclusive central jet production 12861 


514 


D0 Run-2 inclusive jet production 12871 


515 


D0 Run-1 inclusive jet production 12881 



Table 4: Experimental data sets examined in this analysis. 
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CT10, g(x,Q=3.163GeV) 



MSTW08NLO,g(x,Q=3.163 GeV) 




CT10, u(x,Q=3.163GeV) 



MSTW08NLO,u(x,Q = 3.163 GeV) 



Q 
CO 



101^ 

106 

111 

116 

121 

203 

208 

213 

218 

301 

306 

311 

316 

401 

406 

411 

416 

504 

509 

514 

604 

609 






10 = 10 < 10 =i 0.01 0.05 0.1 0.2 0.4 0.7 

X 



nil 












106 






















111 


























11H 






































■^"^i^^^^^^i 


■1 




^^^^^^^^^^ 


208 
213 




















306 
311 






s 
































































411 




























509 














h14 














6114 
















609 


- - - ' - 



10 < 10 ' 0.01 0.05 0.1 0.2 0.4 0.7 

X 



Fig. 12: Correlation cosine between xf in each pT bin from D0 Run-2 inclusive jet production and 
gluon and u quark distributions from CTIO and MSTW 2008 NLO sets. The horizontal axis refers to the 
X value in the PDF. The vertical axis indicates the numerical ID of the experimental bin for which x^ is 
computed. The ID for each bin is indicated as 100 iy + ip^, where iy = I 
corresponding rapidity interval and the pT interval, respectively. 



..6 and ip^ are the ID's of the 
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Fig. 13: Correlation cosine between xf in each rujj bin from D0 Run-2 dijet production and gluon and 
u quark distributions from CTIO and MSTW 2008 NLO sets. The horizontal axis refers to the x value in 
the PDF. The vertical axis indicates the numerical ID of the experimental bin for which x^ is computed. 



The ID for each bin is indicated as 100 i 
corresponding intervals in y, 



+ ir 



. and rrijj, respectively. 



where i 



Vn 



1, .., 6 and Zm,, are the ID's of the 
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by the value of | cos c^| and the magnitude of xf, with the latter being comparable to unity for the majority 
of the data points. 

For D0 Run-2 single-inclusive jet cross sections f287'|, we plot the cos (p values for the gluon and 
ti-quark PDFs, with xf computed for each bin of the jet's transverse momentum px and rapidity y. The 



resulting contour plots are shown in Fig. 12 Similarly, for D0 Run-2 dijet cross sections II289II . Fig. 13 
shows the contour plots of cos if for xf in the bins of of dijet invariant mass ttijj and maximal absolute 
rapidity |y| = max(|yi|, \y2\) of the dijets. In both figures, theory cross sections are computed at NLO 
(without threshold resummation corrections) with the FASTNLO code l|290[ 12911. using the settings 



described in Section 13. The same color legend as in Fig. 11 is used. Similar patterns of correlations 
were found with the CDF Run-2 inclusive jet data (not shown). 

The upper panels in both figures show cos tp for CTIO NLO and MSTW'08 NLO gluon PDFs. The 
correlated experimental errors modify the correlations by smearing the cos ip distribution. The pattern 
of cos(/3 indicates clearly that the (di)jet data are very sensitive to the gluon at x above 0.01. However, 
the correlation is weaker for the CTIO gluon PDF (left panel) then for MSTW'08 PDF (right panel), 
suggesting that the importance of the constraints on the gluon PDF from the jet data is not the same in 
two fits. In addition, the MSTW'08 n-quark PDF shows mild (anti-)correlation with both single-inclusive 



jet data and dijet data, as can be observed in the right lower panels in Figs. 12 and 13 No pronounced 
(anti-)correlations with the u-quark PDF or other quark PDFs of physical flavors are observed for the 
CTIO set, shown in the lower left panels. 

The contour plots confirm the expectation that the inclusive jet data play an important role in 
constraining the gluon PDF. While the constraints are strongest at x > 0.1, they extend down to x as low 



as 0.05 for both CTIO and MSTW sets, as can be observed in Figs. 12 and 13 The gluon PDF is sensitive 
to constraints from heavy-quark semi-inclusive DIS production at even lower x values, cf. Fig. 1 1 As the 
HERA data on heavy-quark DIS production continue to improve, it will play an increasingly important 
role in constraining the low-x gluon density. 

While the patterns of PDF-induced correlations are visually similar for the CTIO and MSTW'08 
sets, they are not completely identical. Constraints on the gluon PDF from Tevatron jet production may 



not be as strong in the CTIO fit as in the MSTW'08 fit, according to Figs. [12] and [13] It remains to 
be investigated what causes the observed differences between CTIO and MSTW sets in the correlations 
involving the gluon PDF. Several features are different in these fits, including different heavy-quark DIS 
schemes, choice of experimental data sets, PDF parametrizations, and radiative contributions in theoret- 
ical cross sections. A combination of these effects may indirectly affect the strength of the constraints 
imposed on the gluon density by the collider jet data. 
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8. PDF CONSTRAINTS FROM ELECTROWEAK VECTOR BOSON PRODUCTION AT THE 

LHCB 

Abstract 

We present a study of the impact of the recent W and Z measurements from 
ATLAS, CMS and LHCb on parton distribution functions. We show that the 
NNPDF2. 1 NNLO predictions are consistent with all the new data, but that 
these provide significant further constraints on the light quarks and antiquarks 
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at medium and small-x. We conclude that these data already have the potential 
to play a useful role in future global PDF analyses. 

8.1 LHC measurements sensitive to PDFs 

The LHC has already provided an impressive set of measurements which are sensitive to parton distribu- 
tions: inclusive jet and dijet data 112921 12931 I294L electroweak vector boson production 112951 12961 12971 
12981 1299, 300] (both inclusive and in association with heavy quarks BOlll ) and direct photon produc- 
tion |[ 302ii303 l. The purpose of this contribution is to quantify the impact on PDFs of a subset of these 
data, the W and Z inclusive production measurements. In this first section we will review the status of 
LHC data relevant for PDF determination and then in the next section we will study how the W, Z data 
impact on the NNPDF analysis. 

Let's begin this short review of LHC data with electroweak vector boson production. ATLAS has 
measured the W lepton and Z rapidity distributions using the 2010 data (36 pb^^) and determined the 
full covariance matrix of correlated experimental uncertainties II295II . This measurement supersedes the 



original muon asymmetry measurement from W decays 1129611 . for which the covariance matrix was not 
available. The CMS collaboration has presented a preliminary measurement of the muon asymmetry 
with 2011 data (234 pb~^) 1.297,1 which supersedes the 2010 data 1,298.1 . In addition it has presented a 
measurement of the normalized Z rapidity distribution using 2010 data 112991 . In neither of these two 
measurements has the full covariance matrix been made available. Finally, the LHCb Collaboration has 
presented preliminary results for the Z rapidity distribution, W lepton asymmetry and W lepton charge 
ratio using 2010 data li300J . 



Data Set 


Ref. 


iVdat 


[^/miiii ^inaxj 


(astat) (%) 


{<Tey.) (%) 


(cTnorm) (%) 


ATLAS W,Z 36 pb"' 

ATLAS ■(¥+ 36 pb^^ 

ATLAS W^" 36 pb^^ 

ATLAS Z 36 pb^^ 


12951 


30 
11 
11 
8 


[0,3.2] 
[0,2.4] 
[0,2.4] 
[0,3.2] 


1.9 
1.4 
1.6 
2.8 


1.7 
1.3 
1.4 

2.4 


3.4 

3.4 
3.4 
3.4 


CMS ^^ rapidity 36 pb"' 
CMS muon asymmetry 234 pb^^ 


1299] 
1297] 


35 
11 


[0,3.6] 

[0,2.4] 


12.3 
1.7 


3.1 






LHCb ^rapidity 36 pb^' 
LHCb W lepton asymmetry 36 pb^^ 


1300] 
13001 


5 
5 


[2,4.5] 
[2,4.5] 


20 
16 


5 
21 


3.4 




Table 5 : The number of data points, kinematical coverage and average statistical, systematic and normalization 
percentage uncertainties for each of the experimental LHC W and Z datasets considered in the present analysis. 
For the CMS Z rapidity data, the systematic uncertainty is included in the statistical uncertainty: there is no 
normalization uncertainty because these data are normalised to the total cross-section. 

The kinematical coverage of each of the various LHC W and Z dataset with the corresponding 
average experimental uncertainties for each dataset are summarized in Table [5] As we can see the LHC 
electroweak data span a large range in rapidity up to t] =4.5. Each of the three processes considered, 
W~^, W~ and Z is sensitive to different partonic subprocesses. 

There are other LHC datasets potentially sensitive to PDFs. Jet production from the Tevatron 
has been a very important measurement not only to constrain the gluon at high x, but in determining 
the strong coupling from a global PDF analysis B 041 1305.1 . Similar constraints are expected from the 
LHC jet data, extended into a wider kinematical range. From the 2010 (36 pb^^) dataset inclusive jet 
and dijet production has been measured by both CMS 112921 129311 and ATLAS |294], however only for 
ATLAS is the full experimental covariance matrix available. The LHC inclusive jet data can be treated 
within a global analysis framework using tools like FastNLO or APPLgrid |306|. Since the full NNLO 
corrections to the inclusive jet production are unknown, jet data in a NNLO analysis can be included 
only within some approximation: for example with NNLO PDF evolution and coupling running but with 
NLO matrix elements, or else with NLO matrix elements supplemented with Sudakov estimates of the 
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NNLO corrections. Another LHC measurement that has the potential to constrain the gluon PDFs is 
prompt photon production from ATLAS [302] and CMS [303|: its consistency with NLO QCD and their 
impact on the NNPDF2.1 PDFs will be discussed in detail in Ref. [.3071 . 

8.2 PDF constraints from LHC W and Z 

Until recently all available NNPDF sets ^M [SDH HH ^M EIIl EH El were based on non-LHC 
data. NNPDF2.2 113141 was the first set to include LHC data, the W lepton asymmetry from ATLAS 
and CMS [[2981 [2961 . However now these two datasets are outdated, the first because now the full 
correlation matrix of the W and Z lepton distributions is available, and the second because data from 
higher luminosities is also available. So we have chosen to continue to use as our baseline the NNPDF2. 1 
NNLO set. 

We now study the impact of the latest LHC W and Z data on the NNPDF parton distributions. All 
our theoretical NNLO predictions will be computed with DYNNLO [315] with the same cuts and settings 
as in the respective measurements. The impact of the new data will be quantified using the reweighting 
method of Refs. ll3T6ll3T4ll applied to the N^^p = 1000 repUcas of the NNPDF2. 1 NNLO set. 

To begin with, we have computed the x^ for each of the datasets in Table [s] for the most re- 



cent NNLO PDF sets currently available on LHAPDF: NNPDF2.1, MSTW08 [;262il . ABKM09 II3T7I . 
HERAPDF1.5 [318] and JR09 [319|. When available, we use the full experimental covariance matrix. 
Normalization uncertainties are included using the to method [320]. This is important specially for the 
treatment of the ATLAS differential distributions where normalization uncertainties are comparable to 
the statistical and systematic uncertainties (See Table [5]). 

The results are summarized in Table |6] For the ATLAS W and Z lepton distributions we show 
the results both for the total dataset and the individual subsets, where in the latter case cross-correlations 
between subsets have been neglected. In all cases the theoretical NNLO predictions have been obtained 
with DYNNLO as discussed above. We can see that none of these PDF sets describes the ATLAS and 
CMS data perfectly, although NNPDF2.1 and HERAPDF1.5 give probably the best description, while 
ABKM09 and JR09 are significantly worse. All five sets give a reasonable description of the LHCb data 
within their large uncertainties. 



Dataset 


X^NNPDF2.1 


X' MSTW08 


X^ ABKM09 


X^ JR09 


X' HERAPDF1.5 


ATLAS 


2.7 


3.6 


3.6 


5.0 


2.0 


ATLAS W^+ 36 pb^" 

ATLAS W"" 36 pb^^ 

ATLAS ^ 36 pb^^ 


5.7 
2.5 
1.8 


6.5 
4.1 

3.7 


11.4 
5.4 
4.2 


5.4 
8.0 
6.5 


5.3 
6.4 
2.9 


CMS 


2.0 


3.0 


2.8 


3.6 


2.8 


CMS Z rapidity 36 pb"^ 
CMS muon asymmetry 234 pb^^ 


1.9 

2.0 


2.9 
3.4 


2.7 
3.0 


2.0 
8.7 


3.0 

2.1 


LHCb 


0.8 


0.7 


1.2 


0.4 


0.6 


LHCb Z rapidity 36 pb^' 
LHCb W lepton asymmetry 36 pb^^ 


1.1 
0.5 


0.7 
0.6 


0.8 
1.6 


0.6 

0.2 


0.8 
0.5 



Table 6: Comparison between LHC W and Z data and the most recent NNLO PDFs. For each PDF set we provide 
the x^ /dof between data and theory predictions, computed using the tg-method. 

For the ATLAS data, we would Irice to emphasize the importance of properly taking into account 
the correlations between datasets, specially the normalization: the description of the individual W^ , 
W~ and Z datasets is always worse than the overall description because of these cross-correlations. 
For the CMS Z rapidity distribution we find that the fixed order NNLO description seems rather worse 
than the NLO-i-LL prediction implemented in POWHEG [,299,1 : the origin of this difference should be 
investigated in future studies. 
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We now discuss the impact of these LHC EW data into the NNPDF2.1 NNLO PDFs ||3T3]| . In 
Table p] we summarize the initial x^ for each dataset, the x^ after reweighting, x?w- We find excellent 
agreement with all the LHC electroweak measurements after reweighting. Some comparisons between 
data and theory for a selected observables are shown in Fig. [14) From top to bottom we show the 



comparison with ATLAS, CMS and LHCb data. In each case we have included all the most updated 
electroweak datasets from each collaboration. 

In Table [7] we also show the effective number of replicas left after the reweighting, defined as in 



Ref. B161 using the Shannon entropy. 



A^r, 



N, 



eff = exp{^ Y^ Wk \n{Nrcp/wk)} 



(76) 



k=l 



In each case we have performed the reweighting separately for each of the experimental datasets individ- 
ually, for the combined datasets from each experiment, and finally with all three combined together. 



Dataset 


x' 


Arw 


NeS 


ATLAS 


2.7 


1.2 


16 


ATLAS W+ 36 pb"^ 

ATLAS yy- 36 pb-i 

ATLAS Z 36 pb-i 


5.7 
2.5 
L8 


1.5 
1.0 

1.1 


17 
205 
581 


CMS 


2.0 


1.2 


56 


CMS Z rapidity 36 pb-i 
CMS muon asymmetry 234 pb^^ 


L9 

2.0 


1.4 
0.4 


223 
200 


LHCb 


0.8 


0.8 


972 


LHCb Z rapidity 36 pb"^ 
LHCb W lepton asymmetiy 36 pb^^ 


LI 
0.8 


1.0 
0.5 


962 
961 


All data combined 


2.1 


1.2 


4 



Table 7: The impact of LHC electroweak measurements on the NNPDF2.1 NNLO PDFs. For each dataset we 
show the initial x^, the x^ after reweighting these particular dataset and the effective number of replicas Ncs in 
this case. We show both the results for individual datasets as well as for the combined impact of all datasets within 
the same experiment. All the results have been computed starting with iVrcp = 1000 replicas. 

When all the datasets are taken together, the initial x^ = 2.1, aheady quite reasonable is reduced 
down to x™ = 1-2. thus obtaining a very good overall description of all the most recent LHC electroweak 
data. The effective number of replicas for all combined datasets is only N^ff = 4 however: from this 
we conclude that to determine the combined impact of these data on PDFs would require many more 
replicas (around 25,000 in fact, to obtain reasonable statistical accuracy), or, more practically, a new fit. 
Note that the fact that the total effective number of replicas for the whole dataset is rather smaller than 
that of any individual subset confirms their mutual compatibility and the lack of any appreciable tension. 
Comparing the effective number of replicas for the individual datasets, the most constraining data are the 
ATLAS W and Z distributions, specially the very precise W~^ data. On the other hand the LHCb data 
have a rather small impact. 

Let us now examine how various PDFs change when new experiments are added. In particular 
the NNPDF2.1 NNLO d{x, Q^) and u{x, Q^ 



M^r as ratios to the 



we show in Fig. [15jthe NNPDF2.1 NNLO d{x, Q^) and u{x, Q'^) PDFs at Q^ 
central value before including the new data. As described above, we put together all the data from a 
given experiment. As can be seen, the ATLAS data give a moderate reduction in PDF uncertainties, 
and a somewhat softer small-x sea quarks, although the old and new PDFs agree at the 1-sigma level. 
For CMS the central values for the old and new PDFs are unchanged with a moderate error reduction at 
medium-x. Finally, for LHCb the PDF uncertainties are almost unaffected, due to the low constraining 
power of these datasets. 
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Fig. 14: Comparison between data and tlieory before and after reweigliting for NNPDF2.1 NNLO compared to 
the various LHC EW datasets considered. From top to bottom we show comparisons with ATLAS (W^ lep- 
ton Z rapidity distributions), CMS (W lepton asymmetry and Z rapidity distribution) and LHCb data (same as 
CMS). For the ATLAS data the error bars include statistical and systematic uncertaintes, but not the normalization 
uncertainties. 



8.3 Conclusions 

In this contribution we have quantified the impact of the most updated LHC electroweak data on the 
NNPDF2. 1 NNLO parton distributions. NNPDF2.1 provides a reasonable description of all these 
datasets even before their impact on the PDFs is included. We find that all the datasets are mutually con- 
sistent, with no obvious tensions. The PDF uncertainties for the light quarks and antiquarks at medium 
and small-x are moderately reduced. The ATLAS W, Z data seem to prefer a softer small-x sea. It is 
clear from our results that the LHC W and Z data should play an important part in any future PDF global 
fit. 
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Fig. 16: Fi vs. Q^ for the HERA combined measurements from HI and ZEUS i321ll . 



9. HEAVY QUARK PRODUCTION IN THE ACOT SCHEME AT NNLO AND N^LO [^ 

Abstract 

We extend the ACOT scheme for heavy quark production to NNLO and N^LO 
for the structure functions F2 and Fl in deep-inelastic scattering (DIS). We use 
the fully massive ACOT scheme up to NLO, and estimate the dominant heavy 
quark mass effects at the higher orders using the massless Wilson coefficients 
together with a generalized slow-rescaling prescription. We present results for 
F2 and Fi showing the effect of the higher orders and the contributions from 
the heavy flavors. 



9.1 INTRODUCTION 

The production of heavy quarks in high energy processes has become an increasingly important subject 
of study both theoretically and experimentally. The theory of heavy quark production in perturbative 
Quantum Chromodynamics (pQCD) is more challenging than that of light parton (jet) production be- 
cause of the additional heavy quark mass scale. The correct theory must properly take into account the 
changing role of the heavy quark over the full kinematic range of the relevant process from the threshold 
region (where the quark behaves like a typical "heavy particle") to the asymptotic region (where the same 
quark behaves effectively like a parton, similar to the well known light quarks {u, d, s}). 

With the ever-increasing precision of experimental data and the progression of theoretical calcula- 
tions and parton distribution function (PDF) evolution to next-to-next-to-leading order (NNLO) of QCD, 
there is a clear need to implement the heavy quark schemes at this order and beyond. The most important 
case is arguably the heavy quark treatment in inclusive deep-inelastic scattering (DIS) since the very pre- 
cise HERA data for DIS structure functions and cross sections form the backbone of any modern global 
analysis of PDFs. Here, the heavy quark structure functions contribute up to 30% or 40% to the inclusive 
structure functions at small momentum fractions x. Extending the heavy quark schemes to higher orders 
is relevant for extracting precision PDFs, and hence for accurate predictions of observables at the LHC. 

An example where higher order corrections are particularly important is the longitudinal struc- 
ture function F^ in DIS. The leading order 0{a^) contributions to this structure function vanishes for 
massless quarks due to helicity conservation (Callan-Gross relation). Since the first unsuppressed con- 
tribution to Fi is at next-to-leading order, the NNLO and N^LO corrections are more important than for 
F2. In Fig. 16 we show the preliminary results for the Fl measurement from the HI and ZEUS experi- 



ments [,32 1.1 . In Fig. [T7] displays sample Feynman diagrams at the various orders. Producing an accurate 
"Contributed by: T. Stavreva, I. Schienbein, F. I. Olness, T. Jezo, K. Kovafik, A. Kusina, J. Y. Yu 
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Fig. 17: Example Feynman diagrams contributing to DIS heavy quark production (from left): LO 0{a^g) 
quark-boson scattering QV — )• Q, NLO 0{a^g) boson-gluon scattering gV — > QQ, NNLO 0{a'^) 
boson-gluon scattering gV — )• gQQ and N^LO 0{ag) boson-gluon scattering gV — )■ ggQQ. 



prediction for F^ is a challenge, particularly in the region of low Q^ and small x. 

In this paper, we will briefly outline the method we used to incorporate the higher order terms, the 
key elements of the ACOT scheme, and the treatment of the heavy quark masses. We then present results 
for the F2 and Fl neutral current DIS structure functions. 

9.2 THE ACOT SCHEME AND ITS EXTENSION BEYOND NLO 



0.03 



0.02 



0.01 




10 20 

Q (GeV) 



100 



Fig. 18: Comparison of schemes for F| at x = 0.1 for NLO DIS heavy quark production as a function 
of Q. We display calculations using the ACOT, S-ACOT, Fixed-Flavor Number Scheme (FFNS), and 
Zero-Mass Variable-Flavor-Number- Scheme (ZM-VFNS). The ACOT and S-ACOT results are virtually 
identical. 



The ACOT scheme II322[ |323]| is based upon the factorization theorem for heavy quarks ['324^; 
hence, it is valid at any order of perturbation theory. The factorization proof ensures that the ACOT 
scheme can be applied throughout the full kinematic regime, and that there is a smooth transition from a 
massless result {m = 0) to the heavy-mass decoupling limit (m — )• 00). 

In the limit where the quark Q of mass m is relatively heavy compared to the characteristic energy 
scale (/x <m), the ACOT result naturally reduces to the Fixed- Flavor-Number-Scheme (FFNS). In the 
FFNS, the heavy quark is treated as being extrinsic to the hadron, and there is no corresponding heavy 
quark PDF, /q(x,/x) = 0. Conversely, in the limit where the quark mass is relatively light (/i > m), 
the ACOT result reduces to the MS Zero-Mass Variable-Flavor-Number-Scheme (ZM-VFNS) exactiy- 
without any finite renormalizations. In this limit, the quark mass m no longer plays any dynamical role; 



it serves purely as a regulator. This feature is presented in Fig. 18 where we can see that the ACOT 
scheme precisely matches the results of the FFNS and ZM-VFNS schemes in their respective limits. 

Additionally Fig. [18] shows the results obtained within the Simplified-ACOT scheme (S- 



ACOT) 113251 . The S-ACOT scheme drops the heavy quark mass dependence for the hard- scattering 
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Fig. 19: Fl vs. Q at N^LO for fixed x = {10"^ lO"'^, 10"^}, (left to right). The three lines show the 
mass effects via the scaling variable: n = {0, 1, 2}, (Red, Green, Blue). We observe the effect of the 
n-scaling is negligible except for small x and Q values. 



processes with incoming heavy quarks or with internal on-shell cuts on a heavy quark line. The S-ACOT 
scheme is not an approximation; it is an exact renormalization scheme, extensible to all orders. Note, the 
ACOT and S-ACOT results agree throughout the kinematic region. 



9.21 Beyond NLO 

While there is no conceptual difficulty with extending the ACOT scheme beyond NLO, the fully massive 
Wilson coefficients have yet to be computed}^ However massless calculations of NNLO and even N^LO 
for F2 and Fl structure functions are available}^ 

The question is: can we use these results, together with the knowledge that ACOT reduces to 
the massless MS (ZM-VFNS) for m -^ 0, to estimate mass effects at NNLO and N^LO? Obviously 
we cannot restore the fully massive ACOT result from the massless limit, but we can try to extract the 
dominant higher order contributions. There are two ways in which mass effects enter the calculation. The 
first is "dynamically" through the mass dependent Wilson coefficients. The second is "kinematically" 
via the restricted phase space. Comparisons using the fully massive results at NLO suggest that the 
kinematic mass effects are dominant, and that much of this dependence can be obtained with a rescaling 
of the Bjorken x variable. We introduce a generalized rescaling x ^ x[l + {nm/Q)'^] where n = is 
the massless result, n = 1 is the original Barnett li329ll rescaling, and n = 2 is the x-rescaling [|3301 . 

Thus, our strategy is as follows. We use the fully massive ACOT result to NLO f3TP\, and add 
to this the massless NNLO and N'^LO contributions using the generalized rescaling prescription. By 
varying n, we can investigate the influence of the kinematic mass in our results. We argue that the 
massless Wilson coefficients at NNLO and N^LO, together with the generalized rescaling prescription 
provide a good approximation of the exact result. At worst, the error is of order a a| x [m'^/Q'^], and 



comparative studies at NLO suggest the error is less{^ For example, in Fig. 19 we display the results 
of Fl for n = {0, 1, 2}. The effects of the detailed mass dependence is most noticeable for low Q^ and 
small X. While the massless scaling result (re = 0) does deviate from the other curves, comparing the 
re = 1 and n = 2 curves we observe the details of the mass rescaling are relatively small. While this is 
not a proof p] this result does give us confidence that the mass effects are under control. 
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Fig. 20: Fractional contribution for each quark structure function F^ ^ for each flavor i = {u, d, s, c, b} 
for (a) Fj and (b) F]^ vs. Q at N^LO for fixed x = {IQ-^, lO'^, IQ-^} (left to right) for the ACOT-x 
scheme. 



9.3 RESULTS 



In Fig. 20 we display the fractional contributions to the structure functions F2 and F^. At larger values of 
X and low Q, we observe that the heavy flavor contributions are minimal. For example, for x = 10^^, we 
see that the u-quark structure function F" comprises r^ 80% of the total structure function. In contrast, 
at X = 10^^ and large Q we see that the contributions of the u and c quarks are comparable (as they 
couple with a factor 4/9), and the d and s quarks contributions are comparable (as they couple with a 
factor 1/9). 



Figure 20 also shows how the x-rescaling introduces a damping of the heavy quark contributions 
as we move from large Q^ values to smaller values. The x-rescaling ensures the heavy quarks (c, b) are 
appropriately suppressed for low Q^ scales. 



In Fig. 21a we display the results for F2 vs. Q computed at various orders; the ratio to the N^LO 
result is displayed in Fig. |21b For large x (c.f. x = 0.1) we find the perturbative calculations are 
particularly stable. We see that the LO result is within 20% of the others at small Q, and within 5% at 
large Q. The NLO is within 2% at small Q, and indistinguishable from the NNLO and N'^LO for Q 
values above ~ 10 GeV. The NNLO and N^LO results are essentially identical throughout the kinematic 
range. For smaller x values (10^^, 10"^), the contributions of the higher order terms are slightly larger. 
Here, the NNLO and N'^LO coincide for Q values above ~ 5 GeV, but the NLO result can differ by 
~ 5% for low Q^ scales. 



In Fig. 22 we display the results for Fl vs. Q computed at various orders. In contrast to F2, we 
find that NLO corrections are large; this is expected because the LO corrections to Fl (which violate 



'^There has been a calculation of neutral current electroproduction of heavy quarks 0(0^) in the FFNS 13261 . however, 
extra contributions are still required for a VFNS calculation [327|. 

'''See Ref. |328| and references therein. 

'■* Details will be presented in a forthcoming publication. 

'^Of course, once the massive higher order Wilson coefficients have been computed, it is straightforward to incorporate these 
results into our calculations. 
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(a) F2 vs. Q at {LO, NLO, NNLO, N^LO} (red, green, blue, cyan) for fixed x = {10"\ 10"^, 10"'^}, (left to right) for 
ACOT-x scheme. 
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(b) Ratio of F2 vs. Q at {LO, NLO, NNLO, N^LO} (red, green, blue, cyan) compared to F2 at N^LO for fixed x 
{10"\ 10"^ 10"^}, (left to right) for ACOT-x scheme. 

Fig. 21: F2 vs. Q at {LO, NLO, NNLO, N^LO}. 



the Callan-Gross relation) are suppressed by (rra^/Q^) compared to the dominant gluon contributions 
which enter at NLO. Consequently, we observe that the LO result for Fi receives large contributions 
from the higher order terms. Essentially, NLO is the first non-trivial order for F^, and the subsequent 
contributions then converge. For example, at large x {c.f. x = 0.1) for (5 ~ 10 GeV we find the NLO 
results yields ~ 70% of the total, the NNLO is a ~ 20% correction, and the N^LO is a ~ 10% coiTcction. 
For lower x values (10"'^, 10~^) the convergence of the perturbative series improves, and the NLO results 
is within ~ 10% of the N^LO result. Curiously, for x = 10~^ the NNLO and N^LO roughly compensate 
each other so that the NLO and the N^LO match quite closely for Q >^2 GeV. 

9.4 CONCLUSIONS 

We have computed the F2 and Fl structure functions in the ACOT scheme at NNLO and N^LO. The 
full mass dependence is computed to NLO, and the dominant mass effects for the higher orders are 
approximated using a generalized rescaling; the details of this rescaling are demonstrated to be small. 
This allows us to make detailed predictions throughout the kinematic range investigated by HERA, and 
we obtain a reasonable estimate of the uncertainty due to the higher order mass effects. Together with 
the precise HERA data, these calculations facilitate accurate determination of the PDFs which are the 
foundation of the LHC calculations. 
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(a) Fl vs. Q at {LO, NLO, NNLO, N^LO} (red, green, blue, cyan) for fixed x = {10"\ 10~^, 10"^}, (left to right) for 
ACOT-x scheme. 
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Part IV 

PHENOMENOLOGICAL STUDIES OF 
OBSERVABLES AND UNCERTAINTIES 

10. FINITE- WIDTH EFFECTS IN TOP-QUARK PAIR PRODUCTION AND DECAY AT THE 

LHCB 

Abstract 

We investigate finite-top-width effects in top-quark pair production by com- 
paring NLO QCD predictions for pp — ;■ WWbb to corresponding pp — ;■ tt — ;■ 
WWbb results in the narrow-top-width limit. Finite-top-width effects, which 
result from non-resonant and off-shell contributions, are discussed in detail for 
the case of the inclusive cross section (with experimental cuts) and for selected 
differential observables in the di-lepton channel. 

10.1 INTRODUCTION 

Top-quark pair production at hadron colliders allows for key tests of the Standard Model and repre- 
sents an omnipresent background to Higgs-boson and new-physics searches. The very large tt samples 
from the Tevatron and the LHC, and the steadily increasing systematic precision call for a continu- 
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ous improvement of theory predictions q] In this context, a reliable theoretical description of exper- 
imental cuts and exclusive tt observables, which depend on details of the W^W^bb final state, re- 
quires higher-order calculations for top-pair production and decay. The first NLO QCD predictions for 
pp — >• tt — )■ WWbb + X II335[ 13361 I337II have been obtained in the narrow-top-width limit, an ap- 
proximation where the 2 — )• 4 particle process is factorised into on-shell tt production and (anti)top 
decays, taking into account spin correlations. In this framework, it was shown that NLO QCD ef- 
fects in top-quark decays have a significant impact on the kinematic properties of final-state leptons 
and b-jets 113351 13361 I337L and play an important role for top-mass measurements at the LHC B38I . 
More recently, NLO QCD predictions for the complete pp — ;■ W"*'W~bb + X process became avail- 
able ll25l l23l . which include all effects related to the finite top-quark width, i.e. on- and off-shell in- 
termediate top quarks, non-resonant contributions, and their interference with resonant tt production. 
Besides new evidence for the importance of NLO corrections to tt production and decay, these studies 
provided a first quantitative assessment of finite-width effects in the inclusive cross section. Applying a 
numerical Ft ^ extrapolation to the NLO pp — )■ W+W~bb predictions, it was found that finite-top- 
width contributions to the WWbb cross section at the Tevatron and the LHC (7 TeV) range from 0.2 to 
1 percent ||25ll23]| . which is perfectly consistent with the expected order of magnitude {rt/rrit ~ 0.9%) 
of finite-top- width effects in inclusive observables. 

In this study, we pursue the investigation of finite-top-width effects by means of a tuned compar- 
ison of the pp — )• W^^W^bb NLO calculation of Ref. [25] against the narrow-top-width approximation 
of Ref. [336|. This permits us, for the first time, to investigate Tt-effects in different phenomenologi- 
cally interesting regions of the WWbb phase space, where large off-shell and non-resonant contributions 
cannot be excluded a priori as in the case of inclusive observables. 

10.2 NARROW-TOP- WIDTH APPROXIMATION AND FINITE-WIDTH EFFECTS 

Let us start by recalling the main features of the NLO QCD calculations of pp — )• tt — )• W+W~bb 
in narrow-top-width approximation |336| and pp — ;■ W"'"W~bb with finite-top-width effects ||25]| . For 
brevity, we denote them as tt and WWbb calculations, respectively. Both calculations implement lep- 
tonic W-boson decays in spin-correlated narrow-W-width approximation. 

In the narrow-top-width limit of Ref. [|336|| , top-quark resonances are approximated by 

1™ T-^ o^ oTvT = wHVf — ""T-t)) (77) 

n/mt^o (p2 - m2)2 + m^r^ mtA ^^^ t^' 

with delta functions that enforce the on-shell conditions, pi = m^ , and are accompanied by 1/ Ft factors. 
Contributions of ©(il/rrit), i.e. terms that do not involve two resonant top propagators, are systemati- 
cally neglected. The differential pp — ;■ tt — ;■ WWbb cross section is factorised into the pp — ;■ tt cross 
section times t — )• Wb partial decay widths, da = (dcrtt dildl^t) l^t' taking into account top-quark 
spin correlations. The LO and NLO predictions can be schematically expressed as 

dcTNLo = A>\o [(d4 + d4) dr^drf + d4 (dr.Mrf + dr^dr-l)] , (78) 

where the superscripts and 1 indicate tree-level quantities and NLO corrections, respectively. The NLO 
prediction involves three terms, where the corrections are applied either to da^i or to one of the decays. 
All ingredients of dciLo and duNLo have to be evaluated with input pai^ameters at the corresponding per- 
turbative order. In particular, LO and NLO predictions must be computed using il^LO and A.nlo decay 



widths, as indicated in (78 1 '^This guarantees that — up to higher-order corrections — the integration over 



'^Recent progress in the theoretical description of top-quark pair production at hadron colliders is reviewed in Refs. 13321 
[333ll334l. 

'^ We note that in the present study the factor _r^~]^Lo in 1 78 1 is not expanded as (-rt,LO + ^t^) ~ ^t LoC^^^-Tt^Z-rt.Lo), 



like in Eq. (6) of Ref. 13361 . since this procedure is not directly applicable to the full WWbb calculation. 
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the phase space of each top decay in (78 1 is consistent with the branching fraction 



n 



n 



BR(t -^ hlu) 



(79) 



In this context, let us point out that a consistent inclusion of finite-W- width corrections — ^both in the scat- 
tering amplitudes and the Ft input parameters — is expected to lead to doubly-suppressed effects. This 
is due to the fact that, in the Tt — ;■ limit, 0(1^) corrections to the numerator and denominator of 
the branching fraction ([79]) cancel. Finite-W-width corrections are thus expected to produce very small 
effects of Ol 



in inclusive observables. This justifies the use of the narrow-W-width approxi- 



Mwint ■ 

mation in combination with finite-top-width contributions, which is the approach adopted in Ref. II25I . 
although in kinematic regions where finite-rt effects become large also finite-W-width corrections ll23l 
might become non-negligible. 

The calculation of Ref. ||25l provides a full description of pp — )• W+W~bb at order O(aga^). 
The top-quark width is incorporated into the complex top mass, fj,"^ = ml — irn-t-Tt, in the complex-mass 
scheme II339I . In this way, off-shell-top contributions are consistently described by Breit-Wigner dis- 
tributions. Besides contributions with two intermediate top resonances, also singly- and non-resonant 
diagrams are taken into account, including interferences. A few representative tree diagrams are shown 
in Fig 23 The NLO WWbb predictions involve factorisable corrections to doubly-resonant diagrams. 



which provide the off-shell extension of NLO corrections in tt approximation (78 1. In addition, there are 
non-factorisable corrections, where tt production and decay parts of the process are connected via ex- 
change of QCD partons, and NLO corrections to singly- and non-resonant topologies. Further technical 
aspects are discussed in the original publications 113361 |25l . 





Fig. 23: Representative LO diagrams of doubly-resonant (upper line), singly-resonant (first diagram in 
lower line), and non-resonant type (last two diagrams in lower line). 



10.3 NUMERICAL RESULTS 

10.31 Input parameters and setup 

In the following we compare tt and WWbb predictions for W^(— ;■ t'ce+)W^(— ;■ /i^i>^)bb produc- 
tion at the Tevatron (pp collisions at 1.96 TeV) and the LHC (pp collisions at 7 and 14TeV). These 
results are based on the same input parameters and cuts as in Ref. 1251 . In NLO (LO) QCD we employ 
MSTW2008NLO (LO) parton distributions [2621 and describe the running of the strong coupling con- 
stant as with two-loop (one-loop) accuracy, including five active flavours. Contributions induced by the 
strongly suppressed bottom-quark density are neglected. For the gauge-boson and top-quark masses we 
use mt = 172 GeV, Mw = 80.399 GeV, and Mz = 91.1876 GeV. The masses of all other quarks, 
including b-quarks, are neglected. In view of the negligibly small Higgs-mass dependence we adopt the 
Mh — )• oo limit, i.e. we omit diagrams involving Higgs bosons. The electroweak couplings are derived 
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Collider ^/s [TeV] approx. 



'^tt [fb] 



a- 



WWbb 



[fb] 



c^tt/^- 



WWbb 



1 Ref. f25\ 



Tevatron 


1.96 


LO 


44.691(8)11°:^ 


44.310(3)1}°:68 


+ 0.861(19)% 


+ 0.8% 






NLO 


42.16(3)1°:°° 


41.75(5)1°:°° 


+ 0.98(14)% 


+ 0.9% 


LHC 


7 


LO 


659.5(1)1??^:? 


662.35(4)1263-4 


- 0.431(16)% 


- 0.4% 






NLO 


837(2)lf7 


840(2)1^1 


-0.41(31)% 


- 0.2% 


LHC 


14 


LO 


3306.3(l)l^°f68 


3334.6(2)1^°^/ 


- 0.849(7)% 









NLO 


4253(3)1282 


4286(7)1283 


- 0.77(19)% 






Table 8: Integrated fee+//^z^/ibb cross section in narrow- with approximation (dtt) and including finite- 
top-width effects (cT^^^-^bb)- The relative error of the narrow-width approximation (sixth column) is 
compared to the prediction of Ref. ll25l (seventh column). Factor-two scale variations in a^i and (7^^^^^^ 
are shown as sub- and super-scripts, while statistical errors are given in parenthesis. 



from the Fermi constant G^ = 1.16637 x 10^ GeV^ in the G^-scheme, where the sine of the mix- 
ing angle and the electromagnetic coupling read s^ = 1 — M^/AI^ and a = \/^G^M^s^/it. For 
consistency, we perform the LO and NLO calculations using the top-quark widths /I^lo = 1-4655 GeV 
and -Tt NLO = 1-3376 GeV B40II . respectively. Since the leptonic W-boson decay does not receive NLO 
QCD corrections we employ the NLO W-boson width /V = 2.0997 GeV everywhere. 

Final-state quarks and gluons with pseudo-rapidity \7]\ < 5 are converted into infrared-safe jets us- 
ing the anti-ZcT algorithm [3411 . For the Tevatron (LHC) we set the jet-algorithm parameter R = 0.4 (0.5) 
and apply the transverse-momentum and pseudo-rapidity cuts px.b-jct > 20 (30) GeV, 1%-jetl < 2.5. 
Moreover, we require a missing transverse momentum of pT,miss > 25 (20) GeV and charged leptons 
withpT.i > 20 GeV and \r]i\ < 2.5. 

For the renormalisation and factorisation scales we adopt the central value jjl = mi and study 
factor-two variations of ^u = /iron = /^fact^ i-C- we compare predictions at /x/?7it = 0-5, 1, 2. The scale 
variations are applied also to -Tt.NLO. but not to /\v- 



/x f^bb cross sections and scale uncertainties at the Tevatron and the LHC 



10.32 Integrated cross section 

Results for the integrated u^^ 
are reported in Table [8] While the (T^^^^^^ results for Tevatron and LHC at 7 TeV correspond to those of 
Ref. [25 qJ the ones for LHC at 14 TeV as well as all a^i predictions are new. Comparing all WWbb 
and tt predictions we find that finite-top-width effects never exceed one percent, both in LO and NLO. 
The statistical precision of the calculations permits us to assess the error of the NWA, Ctt/c^^j^g — 1, 
with an accuracy of 1-3 permille. At the Tevatron, the NWA overestimates the WWbb cross section by 
an amount very close to Tt/mt ~ 0.9%, both in LO and NLO. The error of the NWA at the 7(14) TeV 
LHC ranges between 4 and 8 permille. As shown in the last column of Table [8} these finite- width effects 
are in very good agreement with the results of the I^t — ^ extrapolation in Ref. [25]. Similar results can 
be found also in Ref. 1.23.1 . 



"To be more precise, in Ref. [25| the scale dependence was assessed using a fixed Ft input, while here we take into account 
the ^-dependence of -Tt^NLOi which results into slightly different u^y^^^^ variations at NLO. 
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10.33 Differential distributions 

The small finite-width corrections to the integrated cross section demonstrate that — in presence of stan- 
dard LHC and Tevatron cuts — the NWA provides a fairly accurate description of inclusive WWbb pro- 
duction. It is thus interesting to investigate to which extent this conclusion applies to the various phe- 
nomenologically important regions of the WWbb phase space. To this end we have compared tt and 
WWbb predictions for a few differential observables that are relevant for top-pair production, either 
as signal or as background to Higgs production or new physics. Note that we refrain from selecting 
kinematic variables like the top-quark invariant mass or imposing cuts of type Mwb > 200 GeV, which 
would lead to obvious enhancements of non-resonant contributions. 



In Figs. 24 - 27 we present predictions for some invariant-mass and transverse-momentum distri- 
butions, restricting ourselves to the case of the 7 TeV LHC. For each observable we display tt (dashed 
curves) and WWbb (solid curves) results in LO (blue) and NLO (red) approximation. Absolute pre- 
dictions (left plots) are complemented by the ratios (ddLo — d(TNLo)/do'NLO (upper right plots) and 



(dcTtt-do- 



WWbb 



)/da- 



WWbb 



(lower right plots), which indicate the relative error of LO and narrow-width 



approximations w.r.t. the best predictions, i.e. NLO and WWbb. 
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Fig. 24: Distribution in the transverse momentum of the harder b-jet at the 7 TeV LHC: LO (blue) and 
NLO (red) predictions in narrow-width approximation (tt, dashed) and including finite-top- width effects 
(WWbb, solid). Plotted are absolute predictions (left) and relative deviations of LO (upper-right) and 
narrow-width (lower-right) approximations w.rt. NLO and WWbb predictions, respectively. 



The transverse-momentum distribution of the harder b-jet is shown in Fig. 24 In the range below 



200 GeV, which contains the bulk of the cross section, the NLO and finite-width corrections behave 
similarly as for the integrated cross section: LO predictions deviate from NLO ones by about —20%, and 
the error of the NWA ranges between +1 and —4%. Finite- width effects tend to increase with px and 
reach the 10% level around 300 GeV. Within the entire px range the LO/NLO ratios resulting from the 
tt and WWbb calculations are almost equal. Equivalently, we find the same do-tt/dcr^^^^ ratios in LO 
and NLO. 
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Fig. 25: Distribution in the transverse momentum of the bb di-jet system at the 7TeV LHC: LO (blue) 
and NLO (red) predictions in narrow-width approximation (tt, dashed) and including finite-top-width 
effects (WWbb, solid). Plotted are absolute predictions (left) and relative deviations of LO (upper-right) 
and narrow-width (lower-right) approximations w.r.t. NLO and WWbb predictions, respectively. 



In Fig. 25 we show the transverse-momentum distribution of the bb di-jet system. This kinematic 



variable plays an important role in boosted- Higgs searches with a large tt background. In particular, the 
strategy proposed in Ref. II342II to extract a pp — )• H(— )• bb)W signal at the LHC is based on the selection 
of boosted H — ;■ bb candidates with prp ^^^ > 200 GeV, which permits to reduce tt contamination (and 



other backgrounds) in a very efficient way. As can be seen from Fig. 25 the suppression of tt production 



is indeed particularly strong at pr^ j,^ > 150 GeV. This is due to kinematic constraints that characterise 
the LO and narrow-width approximations: in order to acquire pT,b > ("^t ~ M^)/{2'mt) ~ 65 GeV 
b-quarks need to be boosted via the px of their parent (anti)top quarks, and the fact that top and antitop 
quarks have opposite transverse momenta (at LO) makes it difficult to generate a bb system with high 
Pt- The NLO and finite-width corrections undergo less stringent kinematic restrictions, resulting into 
a significant enhancement of WWbb events at large pr^ ^y,- This is clearly reflected in the differences 



between the various curves in the left plot of Fig. 25 The most pronounced effect comes from the NLO 
corrections, where the tt system can acquire large transverse momentum by recoiling against extra jet 
radiation. As indicated by the right-upper plot, the NLO correction represents 50-80% of the cross 
section at high p^, corresponding to a huge K-factor of 2-5. Finite- width effects (lower-right plot) lead 
to a further significant, although less dramatic, enhancement; for example, non-resonant topologies can 
lead to direct bb production via high-px gluons that recoil against W^ W^ pairs. For pr^ ^^^ > 200 GeV, 
we find that 20-40% of the LO WWbb cross section is due to finite-width contributions, while this 
fraction decreases to 7-15% at NLO. This reduction is related to the dominance of the jet-emission 
contribution, which we expect to be rather well described by the NWA. On the other hand, an optimal 
suppression of the tt background will require a very tight jet- veto 113421 . and in this case we expect 
finite-width corrections to the NLO tt predictions to be as large as in LO. 

The distribution in the missing transverse momentum, i.e. the vector sum of the u^. and P^ trans- 
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Fig. 26: Distribution in the missing transverse momentum at the 7 TeV LHC: LO (blue) and NLO (red) 
predictions in narrow-width approximation (tt, dashed) and including finite-top-width effects (WWbb, 
solid). Plotted are absolute predictions (left) and relative deviations of LO (upper-right) and narrow- 
width (lower-right) approximations w.r.t. NLO and WWbb predictions, respectively. 



verse momenta, is displayed in Fig. 26 This distribution is relevant for new-physics searches based on 
missing transverse energy plus jets and leptons. Its tail features a qualitatively similar behaviour as in 
the case of prp ^y^, due to analogous kinematic constraints. However, in the case of px.miss the corrections 
are less pronounced: the NLO correction does not exceed 40-50% of the full prediction, and finite-width 
contributions stay below roughly 10%. 

Figure [27] displays the distribution in the invariant mass of the positron and a b-jet, i.e. the visible 
products of a top-quark decay. More precisely, assuming that the charge of the b-jet is not known, the 
e"'"b pair is built by selecting the b-jet that yields the smallest invariant massj^In narrow-width and LO 
approximation this kinematic quantity is characterised by a sharp upper bound, M^+^ < ml — M^ ~ 
(152 GeV)^, which renders it very sensitive to the top-quark mass. The value of mt can be extracted 
with high precision using, for instance, the invariant-mass distribution of a positron and a J/ip from a B- 
meson decay II3431I338I . an observable that is closely related to Mg+b. In the region below the kinematic 
bound, the NLO corrections to M^+t, vary between 5-30%, and the impact of the NLO shape distortion 
on a precision rrit-measurement is certainly significant. For M^+j, < 150 GeV, the NWA agrees with 
the WWbb predictions at the 1% level or better. In contrast, in the vicinity of the kinematic bound the 
impact of finite-width (and NLO) corrections becomes clearly more important, giving rise to a tail that 



M^. The resulting contribution to the total cross section is fairly small. 



extends above M^+j^ = m^ 

but the impact of such finite-width effects on the top-mass measurement might be non-negligible, given 

the high rrit-sensitivity of the Af^^^ ~ ml — M^ region. 

^^ Note that the i\fo+b distribution in Ref. (255 was defined by selecting (based on the Monte Carlo truth) jets that involve 
negatively charged b-quarks, such that the e^b pairs are consistent with top-quark decays. 
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Fig. 27: Distribution in the invariant mass of the positron-b-jet system (as defined in the text) at the 7 TeV 
LHC: LO (blue) and NLO (red) predictions in narrow-width approximation (tt, dashed) and including 
finite-top-width effects (WWbb, solid). Plotted are absolute predictions (left) and relative deviations 
of LO (upper-right) and narrow-width (lower-right) approximations w.r.t. NLO and WWbb predictions, 
respectively. 



10.4 CONCLUSIONS 

Based on recent NLO QCD calculations, we have presented a systematic comparison of top-pair pro- 
duction and decay in narrow-top-width approximation, pp — )• tt — )■ WWbb, against the complete 
pp — )• WWbb process, which involves finite-top-width effects of non-resonant and off-shell type. 

At the Tevatron and the LHC (7 and 14 TeV), finite-top-width contributions to the integrated cross 
section (in the di-lepton channel) turn out not to exceed one percent. This confirms previous estimates 
based on the /^ — )■ extrapolation of pp — )• WWbb predictions. At the 7 TeV LHC, we also investigated 
differential observables that are relevant either for top-pair production as a signal or as a background in 
Higgs or new-physics searches. In the case of the b-jet transverse momentum and PT,miss distributions, 
finite- width effects remain very small over a large kinematic range and reach the 10% level only around 
300 GeV. In contrast, the pT-distribution of the bb di-jet system receives il^ -corrections beyond 20-30% 
for prp ^j^ > 200 GeV, a kinematic region that plays an important role in pp — )• H(— )• bb)W searches 
based on boosted H — )• bb candidates. For the lepton-b-jet invariant-mass distribution — an observ- 
able that provides high sensitivity to the top-quark mass — finite-width corrections do not exceed one 
percent in the range that contains the bulk of the cross section, but become more sizable in the region 
of highest ?TT,t- sensitivity. This motivates more detailed studies of finite-width effects in the context of 
high-precision mt -measurements at the LHC. The results of this investigation of finite-width effects in 
tt production give also useful insights into possible limitations of treating associated top-pair production 
processes in the narrow-width approximation, since NLO calculations for pp — ;■ WWbbj and similar 
reactions will not be available too soon. 
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11. Strong and Smooth Ordering in Antenna Showers p^ 

Abstract 

We comment on strong and smooth ordering in antenna showers, and extend 
the definition of smooth ordering to include the case of g ^ qq splittings. We 
define three observables in hadronic Z decays that can be used to probe the 
subleading properties of shower models. 

11.1 INTRODUCTION 

Traditional parton showers are based on coUinear factorization, and the shower evolution proceeds via 
1 — ;■ 2 branchings, on which additional constraints have to be imposed to ensure momentum conser- 
vation and QCD coherence (see 113441). Antenna showers are instead based on momentum-conserving 
and intrinsically coherent 2 — )■ 3 branchings, as pioneered by Ariadne 11345 [1346.1 . This note concerns 
the antenna shower implementation in the Vincia code II347II . a plug-in to Pythia 8 II348II . though we 
emphasize that the notion of smooth ordering could be applied to other shower types as well. 

In leading-logarithmic (LL) antenna showers, the fundamental step is a Lorentz-invariant 2 — )^ 3 
branching process by which two on-shell "parent" partons are replaced by three on-shell "daughter" 
partons. This 2 — )• 3 process makes use of three ingredients II349II : 

1. An antenna function that captures the leading tree-level singularities of QCD matrix elements. 

2. An antenna phase space — an exact, momentum-conserving and Lorentz-invariant factorization 
of the pre- and post-branching phase spaces. 

3. A kinematics map, specifying how the global orientation of the post-branching momenta are re- 
lated to the pre-branching ones. 

Antenna showers come in two varieties: global and sector. The two kinds differ in how the 
coUinear singularities of gluons are partitioned among neighboring antennae, see ||350[ 13511 . Here, we 
shall only be concerned with the global type II345[ 13521 13471 [349 1 . in which the gluon-collinear singular- 
ity is partitioned such that two neigbouring antennae each contain "half" of it; their sum reproduces the 
full singularity. 

If each antenna in a global shower is allowed to emit in its full phase space, the resulting shower 
evolution amounts to an incoherent addition of independently radiating dipoles. This tends to overcount 
regions in which several dipole terms contribute at the same level, i.e., in regions where dipole-dipole 
interference effects (or, equivalently, multipole effects) are important [:353[[35l1l . The situation is anal- 
ogous to, though less severe than, the case of traditional parton showers with virtuality-ordering [[3541 . 
which represent an incoherent addition of independent monopoles. In parton/monopole showers, multi- 
parton interference effects for soft radiation can be taken into account by the requirement of angular 
ordering [.355] . while in dipole/antenna showers, typically a measure of transverse momentum is used, 
such as 

pIa="-^^. (80) 

SIK 

for a branching IK — )• ijk, with Sab = 2pa • ph = {pa + Pb)"^ for massless partons. Some alternative 
possibilities are compared in [[3491 . 



^'Contributed by: J. J. Lopez- Villarejo, P. Skands 
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11.2 STRONG AND SMOOTH ORDERING 

In a strongly-ordered shower, each consecutive branching is required to occur at a lower scale in the 
evolution variable than that of the previous one: (5„+i < Qn- This can be represented as a step function 
in the evolution variable, multiplying the branching kernels. In a smoothly-ordered shower |349|, the 
step function is replaced by a smooth dampening factor designed to leave the soft and coUinear limits 
unchanged while suppressing radiation at scales above ~ Qn- Specifically, for evolution in p±, we 
replace the strong-ordering condition as follows, 

0ip±-p±)PLL -^ P^mpPLL = .^ ^^ 2 ^LL , (81) 

P±+P± 



where p± characterizes the scale of the previous branching^ p± is the scale of the emission under 
consideration, and Pn is an ordinary LL shower kernel, which in our case is represented by a gluon- 
emission antenna function. (We return to the case of g ^ qq below.) 

Thus, for p_i <^ p± (the strongly-ordered limit) the smooth-ordering factor Pimp tends to unity, 
while for p± ~ p± (the ordering threshold) it tends to 1 /2, and finally for p± » p± (highly unordered), it 
tends to zero oc p^/p^. Note that, since Pll is likewise oc l/p^, the net effect of the suppression factor 
is to modify the behavior of the splitting kernel from l/p^ in the strongly-ordered limits to l/p\_ for 
highly unordered branchings, similar to what has been studied for initial-state parton showers in II356II ; 
above the strong-ordering threshold, the branching probability is explicitly suppressed beyond LL. 

For a rigourous interpretation of the Pimp factor one would have to analyze the 2 — )• 4 antennae 
[[791 and check that the combination of two 2 — ;• 3 antennae times this factor does indeed reproduce 
subleading aspects of the full 2 — )• 4 function. In the absence of such a study, one may still physically 
interpret its purpose in the following way: the LL antenna functions are derived assuming the outgoing 
partons/jets to be massless. This is a good approximation if the virtuality that they can acquire (through 
further showering) is restricted by the strong-ordering threshold. When allowing unordered branchings, 
however, the corresponding Feynman diagrams contain highly off-shell propagators, which the Pimp 



factor attempts to mimic by introducing an "effective mass" in the denominator of eq. (81 1. 

For gluon emissions, it was shown in 113491 that the smooth-ordering condition does lead to a sys- 
tematic improvement in the shower. Since it simultaneously guarantees a complete phase-space coverage 
(contrary to the case for strong ordering B57[|349ll ). it is the default option in Vincia. 

Antenna showers including g ^f qq splittings were studied in [|3581 . in which evolution in rrigg 
was introduced for such branchings. This is based on the observation II359II that the scale controlling the 
divergences oi g ^- qq splittings is the invariant mass of the pair, not its p±. By analogy with the physical 
interpretation given to the Pimp factor for gluon emissions above, it therefore seems well-motivated to 
study a "generalized" Pimp factor where each scale depends on whether we are dealing with a gluon or 
a quark: 

where Qe is the evolution variable: p± for gluons and invariant mass for quark- antiquark pairs. 

We can assess the improvement that this produces in the shower by plotting the ratio of the shower 



approximation vs. the LO matrix element for Z — )• qq'q'q and Z — )• qq'q'gq. This is shown in fig. 28 
where the histograms represent the distribution of log^^o (PS/ME) in a flat phase-space scan, normalized 
to unity (i.e., the same type of distributions that were shown in ||349[ 13581 135 ID . Points to the left of 
zero are undercounted by the shower approximation, while points to the right are overcounted. Although 
the agreement is by no means perfect, we do observe a slight improvement in the shower approximation 



^^We take p± to be the smallest p± scale among all the color-connected partons in the parent configuration, i.e., a global 
measure of the "current" p± scale of that topology. This makes the shower a true Markov chain (i.e., history-independent) 
which has beneficial consequences for matching to matrix elements 13491 . 
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Fig. 28: Comparison between generalized Pimp and "old" Pimp factor in the global shower approxima- 
tions to LO matrix elements, for processes involving a. g ^f qq splitting. Left: Z — ;■ qq'q'q. Right: 
Z — )• qq'q'gq. In both cases, GKS matching to the LO matrix element for the preceding multiplicity 
(Z — )• 3 and Z — )• 4, respectively) has been included, and the Ariadne factor was applied to g ^ qq 
splittings. 

when the Pimp factor is defined in terms of Qe (solid black histogram), as compared to the definition 
used previously (dashed histogram). Note that we used the so-called Ariadne factor in the shower ap- 
proximation for all cases, see [,358,1 . and that the distributions were made including GKS matching to the 
preceding multiphcities II349II . 



11.3 SENSITIVE OBSERVABLES IN HADRONIC Z DECAYS 

The properties of shower and matrix-element matching algorithms are coming under increasing scrutiny, 
not least due to the desire of achieving reliable descriptions of jet production and jet properties, such as 
jet substructure, for signal and background estimates at the LHC. 

For final-state radiation, i.e., jet broadening and jet splitting, hadronic Z decays are the main ref- 
erence, with a large set of events shapes and jet resolutions/rates being used to constrain and tune shower 
algorithms (see, e.g., B601I344I ). However, in the logarithmically dominated regions, these observables 
are typically dominated by leading logs, and are well described by all coherent and reasonably well-tuned 
shower algorithms on the market. In order to probe the subleading properties in a more dedicated way, 
we have found the following three simple observables useful, each designed to isolate a specific aspect. 

We consider hadronic Z events (photon ISR is switched off, and matching beyond 3 jets is 
switched off for the strongly-ordered showers) and use the kx clustering algorithm B61I to cluster all 
events back to two jets. The 3 — )• 2 clustering scale is denoted 2/23 = ^tz/'^z^ ^'^'^ ^° ^'^ ^^^ higher 
jet numbers. We require all yij entering in the observables below to be greater than 0.005, to remove 
contamination from B decays and lower scales. Since the original topology contains two jets, we also 
keep track of which "side" each clustering happens on. Strong ordering corresponds to 1/23 ^ 2/34 ^ ■ ■ ■, 
while events with, e.g., 2/34 ~ 7/23 should be more sensitive to the ordering condition and to the effective 
1 — >• 3 spliting kernels. 

The first observable is thus simply the ratio 2/34/7/23, in events where the 4 — )• 3 and 3 — )• 2 



clusterings happen in the same jet. This distribution is illustrated in the left-hand pane of fig. 29 with 
logarithmic axes. Vertical error bars indicate the expected la statistical error with 400k hadronic Z 
decays. Since the kr algorithm allows for unordered clustering scales, the distribution extends beyond 
^24 = In(y34/y24) = 0. Default Pythia (thick solid line) is compared to three different Vincia settings: 
smooth (thin solid) and strong (dashed) ordering in px and strong ordering in dipole virtuality, niD 
(dotted). Note here that ordering in the variables p^ or mD does not directly imply ordering in kx- 
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Fig. 29: Left: ^24 = 111(2/34/^23) in "same-side" 4-jet events. Right: Ratio of jet masses, m'j^/m'jj, 
in "compressed" 4-jet events. Error bars indicate expected la statistical errors with 400k hadronic Z 
decays. 



The fact that the Pimp factor also suppresses branchings slightly below the strong-ordering threshold is 
manifest in the thin solid line lying below the other ones in the region just below zero, which should be 
statistically significant with a sample size of ~ 0.5M events. Note as well that these distributions become 
indistinguishable if one does not make the requirement of sameside clustering (not shown), presumably 
since opposite-side coUinear splittings then dominate. 

A related observable is shown in the right-hand pane of fig. |29] To force a "compressed" scale 
hierarchy, we impose the cut 7/34 > 0.5 y23» and plot the ratio Mj^/M'^ of the masses of the jets at the 
end of the clustering. With four partons at LO, the light jet mass is zero if both the 4 — )• 3 and 3 — )• 2 
clusterings happen in the same jet, while it is non-zero otherwise. Thus, the region close to zero isolates 
events with a 1 — ;■ 3 splitting occurring in one of the jets, while the region above ~ 0.25 is dominated by 
opposite-side 1 — )• 2 splittings. In Pythia and in mass-ordered Vincia, the peak at zero is stronger than 
in the px -ordered Vincia cases, while there is no difference between strong and smooth ordering in this 
variable. It thus serves as a useful complement to ^24- 

Finally, in fig. [30j we consider 4-jet events in which the second and third jets (ordered in energy) 
are nearly collinear and back-to-back to the hardest jet. Specifically, we impose the cuts 6*12 > 120°, 
6*13 > 120°, and 6*23 < 30°. We then plot the angle of the fourth (softest) jet with respect to the hardest 
one. Again the strong and smooth ordering options are indistinguishable, but interesting differences 
with respect to both Pythia and mass-ordered Vincia are visible. Mass-ordering tends to produce a 
broader distribution, with more radiation at right angles to the hardest jet (consistent with mass-ordering 
prioritizing wide-angle emissions over collinear ones), and the p^ -ordered Vincia showers exhibit a 
stronger collinear peak than the Pythia one. A similar observable was proposed in II362II . 

We conclude that, if all three observables could be measured with an accuracy of ~ 5 — 10% or 
better, a useful and multi-dimensional constraint on the subleading shower aspects would be obtained, 
including sensitivity both to the type and shape of the ordering condition, and to the form of the effective 
1 — )• 3 probabilities produced by the shower. We emphasize that we have here restricted our attention to 
shower models that are virtually indistinguishable on all other observables we have considered. 
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12. PERTURBATIVE UNCERTAINTIES AND RESUMMATION FOR EXCLUSIVE JET 

CROSS SECTIONS B 
12.1 Introduction 

In this writeup we discuss predictions for exclusive jet cross sections, which have a particular number 
of jets in the final state. There are several motivations for analyzing events by dividing the data into 
exclusive jet bins, in particular when the relevant backgrounds strongly depend on the number of jets, 
or when the sensitivity can be increased by optimizing the analysis for the individual jet bins. As our 
primary example we will consider the Higgs analysis in the H — )■ WW channel, which is performed 
separately in exclusive 0-jet, 1-jet, and 2-jet bins 11363 [ 1364"! 13651 . Other examples are vector-boson 
fusion analyses, which are typically performed in the exclusive 2-jet channel, boosted H ^ bb analyses 
that include a veto on additional jets, as well as //—)• rr and i7 — )• 77 which benefit from improved 
sensitivity when the Higgs recoils against a jet. The importance of the Higgs -1- 1 jet channel in H ^ tt 
and H — ;■ WW* was demonstrated explicitly in Refs. II366[ 1367 1. Another motivation for studying 
exclusive jet bins are the W+ jets channels, which are important backgrounds for new physics searches. 
We will use the notation a^ for an exclusive A^-jet cross section (with exactly N jets), and the notation 
a>N for an inclusive A^-jet cross section (with N or more jets). 

To explore the implications of the jet bin restrictions, consider a simple example where we divide 
the total cross section, cxtotah into an exclusive 0-jet bin, (To(p'^"*), and the remaining inclusive (> l)-jet 
bin,c7>i(ij'=-*). 



O'total 



fP'"' da 



Jo 



dp 



dp 



J p^ 



ut 



dp 



CUtN 



(83) 



Here p denotes the kinematic variable which is used to divide up the cross section into jet bins. A 
typical choice is p = p^^ , defined by the largest px of any jet in the event, such that a^ip'^^) only 
contains events with jets having pT < p™*, and (T>i(p™*) contains events with at least one jet with 
Pt > Pt"*- By defining aQ{p^^) and o">i(p^*) one has divided up initial-state radiation from the 

^''Contributed by: Iain W. Stewart, Frank J. Tackmann 
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colliding hard partons and soft radiation in the event. This restriction on additional emissions changes the 
coefficients appearing in the ag expansion and leads to the appearance of double and single logarithms 
of the form as\v?{p™^ /Q) and a^ ln(p^"'/Q) (with higher powers a"ln'"-^"(p"^^*/(5) appearing at 
higher orders in perturbation theory). Here Q is the hard scale of the process, such as Q = rriH for Higgs 
production, and most often we have p^^'^ <C Q. These changes to the perturbation series can modify the 
convergence of fixed-order results and make it prudent to consider resummed cross section predictions 



that include an all-orders resummation of the large logarithms. For N jets the analog of Eq. ( 83 1 is 
cr>Ar = (JNip'^+i) + f^yN +i{p^nXi) ^"^^ ^^^ same discussion apphes regarding the large logarithms of 
p^* that are not present in cr>Ar, but are present in each of a^ and a>N+i- 

The definition of (7o(p'^"*) may include dependence on rapidity and on the grouping of particles. 
For a jet-based variable like j?^ the former is induced by only considering jets within the rapidity range 
|,^jet| ^ ^cut^ ^j^j j-j^g latter enters through the choice of jet algorithm. These dependencies make the- 
oretical predictions more difficult. In Higgs production via gluon fusion the cross section is known to 
next-to-next-to-leading order (NNLO) |l208l[3Mll2Q7l|209l|2l0l|2n][369l[37a, and NNLO results in- 
cluding full kinematic information are available through FeHiP ll2T4l[37T | and HNNLO f90'.'2r6| (as well 
as by combining the total NNLO cross section with MCFM II1571 I372]| for some distributions). When the 
measurements are performed in exclusive jet bins, the perturbative uncertainties in the theoretical pre- 
dictions must also be evaluated separately for each individual jet bin II373II . When combining channels 
with different jet multiplicities, the correlations between the theoretical uncertainties can be significant 
and must be taken into account 1261 . The perturbative predictions can be made more precise by including 
a resummation of large p'^"* dependent logarithms on top of the fixed-order predictions. At the leading 
logarithmic level this can be achieved with standard parton shower Monte Carlo programs, regardless of 
the precise definition ofp'^^^. So far a next-to-next-to-leading logarithmic (NNLL) resummed result for 
a jet- veto variable only exists for beam thrust [3741 . Tcm, which is a rapidity weighted E'^-like inclusive 
variable. The definitions of the jet- veto variables we will use are 

J^T = I E ^>4 'T'- = E \PTk\e-\'^'^ = ^{Ek - \pl\) . (84) 

fcsjet k k 

For p'^ our jets are defined using anti-fcy l'3411 with R = 0.5, and we consider jets that satisfy a rapidity 
cut \ri\ < ri'^^^. For Tcm the sum is over all objects in the final state except the Higgs decay products, and 
can in principle be considered over particles, topo-clusters, or jets with a small R parameter. In all our 
results we consistently use MSTW2008 NNLO PDFs ll262l . 

In this writeup we will explore fixed NNLO and resummed NNLL-i-NNLO predictions for H+ 
0-jet cross sections and compare various methods for evaluating the uncertainty as a function of cuts on 
Pji and Tcm.- The three methods we will discuss for evaluating the uncertainties in exclusive jet cross 
sections are 

A) "Direct Exclusive Scale Variation". Here the uncertainties are evaluated by directly varying the 
renormalization and factorization scales in the fixed-order predictions for each exclusive jet cross 
section a^- This implies that the uncertainties are 100% correlated for different A^s. 

B) "Combined Inclusive Scale Variation", as proposed in Ref. [,26.1 and utilized in Refs. II363[ 13641 



I365II . Here, the perturbative uncertainties in the inclusive N-]ei cross sections, a>N, are treated 
as the primary uncertainties that can be evaluated by scale variations in fixed-order perturbation 
theory. These uncertainties are treated as uncorrelated for different N. The exclusive A^-jet cross 
sections are obtained using a^ = cr>N — cy>N+i- The uncertainties and correlations follow 
from standard eiTor propagation, including the appropriate anticoiTclations between a^ and aN±i 
related to the division into jet bins. 
C) "Uncertainties from Resummation." Resummed calculations for exclusive jet cross sections can 
provide uncertainty estimates that allow one to simultaneously include both types of correlated 
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and anticorrelated uncertainties as in methods A and B. The magnitude of the uncertainties may 

also be reduced from the resummation of large logarithms. 
In all three methods, adding the exclusive jet cross sections yields the expected scale variation in the 
total cross section. Method B avoids a potential underestimate of the uncertainties in individual jet bins 
due to strong cancellations that can potentially take place in method A. Method B produces realistic 
perturbative uncertainties for exclusive jet cross sections when using fixed-order predictions for various 
processes, since it accounts for the presence of large logarithms at higher orders caused by the jet binning. 
In Method C one utilizes higher-order resummed predictions for the exclusive jet cross sections, which 
allow one to obtain improved central values and further refined uncertainty estimates. 



The basic structure of the large logarithms in the perturbative series is discussed in Sec. 12.2 In 



Sec. 12.3 we discuss and compare the above three methods to determine the perturbative uncertainties. 
The work discussed here regarding methods A, B, and C builds on work done in Refs. 0751 l26l . was 
initiated at Les Houches, and has also been incorporated in the second Higgs Yellow Book report II376II 



(Sees. 5.2 and 5.5.) We also review recent work by others that can be found in B76ll (Sec. 5.3). 

Note that here we are only discussing the theoretical uncertainties due to unknown higher-order 
perturbative corrections, which are commonly estimated using scale variation. Parametric uncertainties, 
such as PDF choices and as{mz) uncertainties, must be treated appropriately as common sources for all 
investigated channels. 

12.2 Theoretical Motivation 

12.21 Structure of the Perturbative Series 

We begin by discussing the structure of the large logarithms in exclusive jet cross sections. For Higgs 
production from gluon fusion with pj, < p^^ the leading double logarithms appearing at 0{as) are 

ao{pr) = ^B(l-—2ln'?^ + ■••), (85) 

V vr niH ' 

where as is the Born (tree-level) cross section. 

The total cross section only depends on the hard scale Q = mu, which means by choosing the 
factorization and renormalization scales ^f ~ j^l^ ~ mn, the fixed-order expansion does not contain 
large logarithms and has the structure 

^totai ~ as [1 + a, + a^ + 0{al)] . (86) 

Our expressions for perturbative series such as this one are schematic, showing the scaling of the terms 
without the coefficient functions. The convolution with the parton distribution functions (PDFs) are also 
not displayed. For gg — )• H, the coefficients of this series can be large, corresponding to the well-known 
large K factors. As usual, varying the scale in Qs(/i) (and the PDFs) one obtains an estimate of the size 
of the missing higher-order terms in this series, which we denote by Z^totai- 
The inclusive 1-jet cross section has the perturbative structure 

fT>i(/"*) ^ aB [as{L^ + L + l) + al{L^ + L^ + L^ + L + l) + 0{alL^)\ , (87) 

where the logarithms L = ln{p'^^^ /uih)- For p^'^^ <^ mu these logarithms can get large enough to 
overcome the q^ suppression. In the limit OgL^ ~ 1, the fixed-order perturbative expansion breaks down 
and the logarithmic terms must be resummed to all orders in a^ to obtain a meaningful result. For typical 
experimental values of p^"' fixed-order perturbation theory can still be considered, but the logarithms 
cause large corrections at each order and dominate the series. 



The exclusive 0-jet cross section is equal to the difference between Eqs. ( [861 ) and ( [87] ), and so has 
the schematic structure 
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~ (7^1 [l + as + a'i + 0(af )] - [as{L'^+ L + 1) + a'i{L^+ L^+ L'^+L + 1) + 0{a^,L^)] } 

(88) 

In this difference, the large positive corrections in citotai partly cancel against the large negative logarith- 
mic corrections in a>i. For example, at 0{as) there is a value of L for which the as terms in Eq. ( [88] ) 
cancel exactly. At this p"^^^ the NLO 0-jet cross section has vanishing scale dependence and is equal 
to the LO cross section, ao{p'^'^^) = as- Due to this cancellation, a standard use of scale variation in 
cro(p'^"*) does not actually probe the size of the large logarithms, and does not provide an estimate of 
Z\cut- This issue impacts the uncertainties in the experimentally relevant region for p'^^^. 

For example, for gg ^ H (with ^/s = 7TeV, mn = 165 GeV, jdf = fXr = mn/^), one 
finds II2T41 13711 I901I2T611 

cTtotai = (3.32 pb) [1 + 9.5 as + 35 a^ + 0{al)] , 
(7>i{l^T >30GeV, |V^*| <3.0) = (3.32 pb) [4.7a, + 26 a^ + ©(a^)] . (89) 

In CTtotai one can see the impact of the well-known large K factors. (Using instead fif = Hr = mn 
the 9. Sag and 35a^ coefficients in cxtotai increase to lla, and 65a'^.) In cr>i, one can see the impact of 
the large logarithms on the perturbative series. Taking their difference to get (Tq, one observes a sizeable 
numerical cancellation between the two series at each order in a^- 

12.22 Perturbative Series for the Event Fraction 

Experimentally the desired quantity which incorporates the jet- veto cut is the exclusive 0-jet event frac- 
tion 

'T'total O"total 

One option for treating fo{p^^^) is to consider it as a derived quantity, given the basic observables 
Wo, o"totai} or {(T>i, cjtotai}- In this approach, which was utilized in Ref. l|26l and Ref. ll376i (Secs. 5.2 
and 5.5), one propagates the uncertainties from the crjS to derive those for /o(p^'^*)- This approach is 
natural from the perspective of utilizing log-resummed computations for ao{p™^). In particular, it main- 
tains the constraint that for large p'^"* we have monotonic convergence of ctq —5- Ctotai and /o —5- 1, a 
property that relies on a phase space cut reducing the cross section, but does not depend on perturbation 
theory. 

When using fixed-order predictions for the various cross sections, an alternative to Eq. ( |90| con- 
sidered in Ref. [376|(Sec. 5.3) is to analyze the perturbation theory for /o(p'^^*) directly. In this case 
different schemes of organizing the perturbation series, by keeping or dropping various 0{a^) terms, 
give a method to estimate the size of the higher-order perturbative corrections. Three such schemes were 
considered in Ref. ||376l (Sec. 5.3) (which we label here by schemes 1,2,3). It is convenient to define the 
perturbative corrections to the cross section by dividing each of them by the Born cross section as, such 
that we can write 

CTtotai = ^b[1 + aSai + ^[^1 + 0(«2)] , 

^>i(p^"*) = 'JB [4'J(P^"*) + <^g(/"*) + 0(a3)] . (91) 

With this notation the result of treating /o as a derived quantity is 

. n f scheme 1) ^>} (p'""* ) + ^^1 (p'"* ) 

1 + ^total + "^total 
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while at the same order in perturbation theory we can also consider the following expressions for /q: 

[/o(p-*)] (^^^'^"^^ ^) = 1 - [ag(p^") + 4'i(^^")] + 4'? ^Sai + ^(«') • (93) 



We will contrast using the expressions in Eq. ( [92] ) and Eq. ( [931 ) with various methods for analyzing the 
uncertainty in our discussion below. 

12.3 Uncertainty Analysis for Exclusive Jet Bins 



As described in Sec. 12.21 the phase space restriction defining ctq changes its perturbative structure 



compared to that of o"totai- In general this gives rise to an additional perturbative uncertainty due to 
missing higher-order terms depending on p'^"*. We will call the associated jet-binning uncertainty Z\cut- 
This can be thought of as an uncertainty related to the presence of large logarithms of p'^"* at higher 
orders in perturbation theory. In Eq. ([83[) both ao and o">i depend on the phase space cut, p'^"*, and by 



construction this dependence cancels in ctq + (T>i. Hence, the additional uncertainty Z\cut induced by 
^cut jjjygf ]jQ 100% anticorrelated between o"o(p'^"*) and a>i{p^^^), such that it cancels in their sum. For 
example, using a covariance matrix to model the uncertainties and correlations, the contribution of Z\cut 
to the covariance matrix for {ctq, cr>i} must be of the form 

/ a2 _a2 \ 

"'"*= X" aH- •"" 

\ ^cut ^cut / 

The questions then are: (1) How can we estimate Z\cut in a simple way, and (2) how is the perturbative 
uncertainty Z\totai of <7totai related to the uncertainties of (Tq and (T>i? 

12.31 Perturbative Uncertainties for Method A 

When using method A to estimate the perturbative uncertainties one simply uses a common scale varia- 
tion to estimate the uncertainty Aq in ao and the uncertainty Z\>i in a>i. By doing so the uncertainties 
are 100% correlated, corresponding to a covariance matrix in method A for {o"o, (T>i} given by 



Ca= ' . ^M . (95) 




Here Z\totai = -^o + ^>i is the scale uncertainty in (Ttotai- When instead of o"o we directly calculate the 
0-jet event fraction /o using Eq. ( [92] ) or one of the expressions in Eq. ( [93] ), we can again determine the 
method A uncertainty estimate by scale variation in /o (we will refer to these results as methods Ai, A2, 
and A^ respectively). 

In this method Z\cut is not included because, as explained below Eq. ( [88] ), varying the perturbative 
scale in Aq does not probe the presence of the higher order lai^ge logarithms depending on p'^"*. This 
method can lead to an underestimate of the perturbative uncertainty in ao (and hence /o), since there is a 
region of p'^"* values where scale variation is no longer a reasonable estimate of higher order corrections 
because of the vanishing of the fi dependence. 

12.32 Perturbative Uncertainties for Method B 

Since the perturbative series for a>i in Eq. ( [87] ) is dominated by the large logarithms of p'^"*, we can 
use its scale variation Z\>i to get an estimate for their size by taking Z\cut = A>i |[26l . Since Z\cut and 
^totai are by definition uncorrelated, by setting Z\cut = A>i we are effectively treating the perturbative 
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series for (Ttotai and a>i as independent with uncorrelated perturbative uncertainties. That is, considering 
{ctotaii o'>i}, the covariance matrix is diagonal, 



^total 

, 




(96) 



where Z\totai and Z\>i are evaluated by separate scale variations in the fixed-order predictions for cJtotai 
and o">i. This is consistent, since for small p'^"* the two series have very different structures. In particular, 
there is no reason to believe that the same cancellations in ctq will persist at every order in perturbation 
theory at a given p'^"*. It follows that the perturbative uncertainty in (Tq = o"totai — cr>i is given by 
^totai + '^>i' ^^'^ ^^^ resulting covariance matrix for {ao,a>i} in method B is 

Cb= f4l+ Altai -4A . (97) 

Note that all of /\totai occurs in the uncertainty for cjo. This is reasonable from the point of view that ctq 
starts at the same order in as as o"totai and contains the same leading virtual corrections. The method B 
uncertainty for the event fraction /o follows most naturally by error propagation from the cross sections, 
treating it as a derived quantity. 



The limit Z\cut = ^>i that Eq. ( |97| ) is based on is of course an approximation. However, the 
preceding arguments show that it is a more reasonable starting point than method A, since the latter does 
not account for the additional p'^^^ induced uncertainties. 

The generalization of the above discussion to more jets and several jet bins is straightforward. 
For the A'^-jet bin we replace cxtotai — ^ 0">Af> ctq — ^ (^N^ and a>i — ;• cr>Af+i. If the perturbative series 
for (T>iv exhibits large a^ corrections due to its logarithmic series or otherwise, then the presence of 
a different series of large logarithms in (j>Ar+i will again lead to cancellations when we consider the 
difference a]\j = a>N — c>Af+i- These two cross sections will have different series for their double 
logarithms since the number of active partons and their color structure differ. In this situation A^n^i 
will again give a better estimate for the extra Z\cut type uncertainty that arises from separating a>N into 
crivando->Ar+i. 

12.33 Perturbative Uncertainties for Method C 

In method C we assess the perturbative uncertainties using resummed predictions for variables p'^^^ that 
implement a jet veto, following Refs. f375' '261. An advantage of using resummed predictions is that 
they contain perturbation theory scale parameters which allow for an evaluation of two components of 
the theory error, one which is 100% correlated with the total cross section (as in method A), and one 
related to the presence of the jet-bin cut which is anti-correlated between neighboring jet bins (as in 
method B). 

The resummed H + 0-jet cross section predictions of Ref. II375I follow from a factorization the- 
orem for the 0-jet cross section II374II . o"o(7^™*) = HXgiZgj (g) Sfifj, where H contains hard virtual 
effects, the Is and S describe the veto-restricted collinear and soft radiation, and the /s are standard par- 
ton distributions. Fixed-order perturbation theory is carried out at three scales, a hard scale /i|^ ~ m'jj 
in H, and beam and soft scales fi'^ r^ ''t^hT^^ and /i| ~ (7^m*)^ for ^ and S, and are then connected 
by NNLL renormalization group evolution that sums the jet- veto logarithms, which are encoded in ratios 
of these scales. The perturbative uncertainties can be assessed by considering two sources: i) an overall 
scale variation that simultaneously varies {//// , ^u^ , Ai^} up and down by a factor of two which we denote 
by Ahq, and ii) individual variations of ^b or jjls that each hold the other two scales fixed [375 1, whose 
envelope we denote by the uncertainty Asb- Here Ahq is dominated by the same sources of uncertainty 
as the total cross section cJtotai. and hence should be considered 100% correlated with its uncertainty 
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Fig. 31: Relative uncertainties for the 0-jet bin cross section from resummation at NNLL+NNLO for 
beam thrust Tcm on the left and p^^ on the right. 



^totai- The uncertainty Asb is only present due to the jet-bin cut, and hence gives the Z\cut uncertainty 
that is anti-correlated between neighboring jet bins. 

If we simultaneously consider the cross sections {o"o,(T>i} then the full correlation matrix in 
method C is 



Cc 



-/\2 



-^Ib 



+ 



AhoAh>i 



AhgAh>i 



(98) 



where Z\h>i = /\totai — ^Ho encodes the 100% correlated component of the uncertainty for the (> 1)- 
jet inclusive cross section. Computing the uncertainty in o"totai gives back Z\totai- 



Eq. ( |98| ) can be compared to Ca for method A in Eq. ( [95| ), which corresponds to taking Asb — ^ 
and obtaining the analog of Aho by up/down scale variation without resummation {^h = l^B = Ms)- 
It can also be compared to Cb for method B in Eq. (97 1, which con^esponds to taking Asb — ^ A>i 
and Ah>i — > 0, such that Abo — )■ /^totai- The numerical dominance of /^|^ over AhqAh>i in 
the 0-jet region is another way to justify the preference for using method B when only given a choice 



between methods A and B. For example, for p^* 

AmAH>i = 0.02. 



30 GeV and jr/J'^*! < 5.0 we have A%„ = 0.17 and 



In Fig. 31 we show the uncertainties Asb (light green) and Ahq (medium blue) as a function of 
the jet-veto variable, as well as the combined uncertainty adding these components in quadrature (dark 
orange). From the figure we see that the Aho dominates at large values where the veto is turned off 
and we approach the total cross section, and that the jet-cut uncertainty Asb dominates for the small cut 
values that are typical of experimental analyses with Higgs jet bins. The same pattern is observed in the 
left panel which directly uses the NNLL-i-NNLO predictions for 7^'^^'' ^^^^ the right panel which shows 



the result from reweighting these predictions to p^ as explained in Sec. 12.34 below. 



12.34 Comparison of Uncertainty Methods 



In Fig. 32 we compare the uncertainties for the 0-jet bin cross section from methods A (medium green), 
B (light green), and C (dark orange). In the upper panels we use 7^^* as the jet-veto variable and full 
results for the NNLO and NNLL-i-NNLO cross sections, while in the lower panels we use p^^^ as the 
jet-veto variable with the full NNLO and the reweighted NNLL-i-NNLO results (as explained below). 
The upper panels use a cut on beam thrust, 7^*^* while the lower panels use p^^"*. The right panels show 
the same results as those on the left, but are normalized to the highest-order result to better show the 
relative differences and uncertainties. The uncertainties in methods A, B, and C are computed from the 
upper left entry of the matrices Ca, Cb, and Cc, respectively. 
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Fig. 32: Comparison of uncertainties for methods A, B, C for the 0-jet bin cross section for beam thrust 
T^m (top) and p^^ (bottom). Results are shown at NNLO with uncertainties from methods A and B and 
for the NNLL+NNLO resummed result using method C (reweighted for p^*). On the right all curves are 
normalized relative to the NNLL+NNLO central value. 



From Fig. 32 we see that in method A (medium green bands) for small values of p^* the can- 
cellations that take place in a^ij^^^) cause the error bands to shrink and eventually almost vanish at 
p™* ~ 25 GeV, where there is an almost exact cancellation between the two series in Eq. ( (88| ). This is 
avoided by using method B (light green bands). For large values of p^* method B reproduces the method 
A scale variation, since a->\(jf^^^ becomes small. On the other hand, for small values of p^* the uncer- 
tainties estimated using method B are more realistic, because they explicitly estimate the uncertainties 
due to the presence of higher order large logarithmic corrections. 

The features of this plot are quite generic. In particular, the same pattern of uncertainties is ob- 
served for the Tevatron, when using [i = niH as our central scale (with ^ = 2m// and /i = m///2 for the 
range of scale variation), whether or not we only look at jets at central rapidities, or when considering 
the exclusive 1-jet cross section. We also note that using independent variations for /xj and /u^ does not 
change this picture, in particular the ^j variation for fixed ^r is quite small. 

For method C with Tcm we make use of resummed predictions f or // + jets from gluon fusion 
at next-to-next-to-leading logarithmic order (NNLL+NNLO) from Ref . [.375.1 . This includes the correct 
NNLO fixed-order corrections for <to(7^^*) for any cut. The resulting cross section <to(7^™*) has the jet 
veto implemented by a cut Taa < T^m*- This cross section contains a resummation of large logarithms 
at two orders beyond standard LL parton shower programs. A similar resummation for the case of p^^ 
is not available. Instead, we use MC@NLO and reweight it to the resummed predictions in Tcm, doing 
so for both the central curve as well as each of the six scale variation curves needed for the uncertainty 
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Fig. 33: In the left panel we show the same three curves as in the bottom-left panel of Fig. 32 but for the 



event fraction fo{p'^^) treated as a derived quantity from the jet-bin cross sections. In the right panel we 



contrast the uncertainties obtained using Eqs. ( 92 1 and ( 93 1 together with method A, with the uncertainty 
obtained using method B. 



determination in method C^jWe then use the reweighted Monte Carlo sample to obtain cross section 
predictions for the standard jet veto, o"o(p™*). We will refer to this as the reweighted NNLL-i-NNLO 
result. Since the Monte Carlo here is only used to provide a transfer matrix between Tcm and pj, , 
and both variables implement a jet veto, one expects that most of the improvements from the higher- 
order resummation are preserved by the reweighting. However, we caution that this is not equivalent 
to a complete NNLL-i-NNLO result for the p™* spectrum, since the reweighting may not fully capture 
effects associated with the choice of jet algorithm and other effects that enter at this order for p^^. The 
dependence on the Monte Carlo transfer matrix also introduces an additional uncertainty, which should 
be studied and is not included in our numerical results. The transfer matrix is obtained at the parton 
level, without hadronization or underlying event, since we are reweighting a partonic NNLL-i-NNLO 
calculation. 



From Fig. 32 one observes that the resummation of the large jet- veto logarithms (dark red central 



curve) lowers the cross section for both 7^*^ and p^ . Comparing to NNLO for cut values > 25 GeV 
the relative uncertainties in the resummed result of method C (dark orange bands) and the reduction 
in the resummed central value are similar for both jet-veto variables. Since one expects resummation 
to decrease the uncertainties, one can also see that the NNLO uncertainties from method B are more 
consistent with the higher order NNLL-i-NNLO resummed method C results than those in method A. We 
observe that the uncertainties in method C are reduced by about a factor of two compared to those in 
method B. Since the zero-jet bin plays a crucial role in the H — ;■ WW channel for Higgs searches, and 
these improvements will also be reflected in uncertainties for the one -jet bin, the improved theoretical 
precision obtained with method C has the potential to be quite important. 

we show results for the 0-jet event fraction /o, with p™* as the jet-veto variable. In the 



In Fig. 
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left panel we compare the uncertainties in fo{p^^) that result from propagating the uncertainties from the 
jet-bin cross sections obtained from methods A (medium green), B (light green), and C (dark orange). 
The conclusions are analogous to the corresponding cross-section results in the bottom-left panel of 
Fig. [32j namely that method B provides a better estimate for the perturbative fixed-order uncertainties 
than method A, and that the higher-order logarithmic summation present in method C leads to a slightly 
smaller central value together with the decrease to the uncertainty one expects from incorporating the 



resummation. In the right panel of Fig. 33 we show the results of the different perturbative schemes for 



/o defined in Eq. (92 1 (middle dark green band) and Eq. (93 1 (lower narrow blue band and upper wide 



^''We thank Fabian Stockli for collaboration on this NNLL+NNLO reweighting analysis forp^^. 
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method A 


method B 


method C 


Saoip^'') 


3% 


19% 


9% 


6(7>l{p^^') 


19% 


19% 


14% 


^(o'totahO'o) 


1 


0.78 


0.15 


P(<7total,0">l) 


1 





0.65 


p{ao,a>i) 


1 


-0.63 


-0.65 


5fo{p'T^') 


6% 


13% 


9% 


5f>i{pT) 


10% 


21% 


11% 


P(o"total, /o) 


-1 


0.43 


-0.38 


P(<7totah/>l) 


1 


-0.43 


0.38 



Table 9: Example of relative uncertainties 5 and correlations p obtained for the LHC at 7 TeV for p^"* = 

30GeVand|ryJ^*| < 5.0. 

yellow band) each at NNLO and in each case obtaining the uncertainties using method A (direct scale 
variation) ll376 1(Sec. 5.3). For comparison, the middle light green band shows the uncertainties obtained 
from method B. The different method A schemes have a wide spread, which demonstrates the large size 
of the higher-order perturbative corrections in the total and inclusive 1-jet cross sections. The central 
values of the alternative methods A2 and A^ are not covered by the method Ai uncertainty band, but 
all three central curves are covered by the larger uncertainty band from method B (except at small p^"* 
where scheme 3 starts to diverge earlier than the other schemes). This can be taken as a confirmation that 
method A tends to underestimate the perturbative uncertainties in the fixed-order results li376i] (Sec. 5.3), 
while method B produces more realistic fixed-order uncertainties. 

To appreciate the effects of the different methods on the correlation matrix we consider as an 
example the results for p^* = 30 GeV and 1 77^''* | < 5.0. The inclusive cross sections are Utotai = (8.76 ± 
0.80) pb at NNLO, and o">i = (3.31 it 0.64) pb at NLO. The relative uncertainties and correlations at 
these cuts for the three methods are shown in Table |9] The numbers for the cross sections are also 
translated into the equivalent results for the event fractions, /o(p™*) = cro(p™*)/<7totai and f>i{p'^^) = 
(^>i{p't^) / (^ total- Note that method A should not be used due to the lack of a contribution corresponding 
to Z\cut in this method, and the resulting underestimated 5aQ. In methods B and C we see, as expected, 
that (To and (T>i have a substantial anti-correlation due to the jet-bin boundary they share. 

12.4 Conclusion 

To summarize, we have discussed the implications of separating LHC cross sections into jet bins, using 
Higgs production from gluon fusion as a concrete example. The jet binning induces logarithmic depen- 
dences on the jet-bin boundary which is important to properly take into account when making predictions 
and estimating perturbative uncertainties. When using fixed-order predictions only, the additional log- 
arithms at higher orders in perturbation theory caused by the jet binning can be taken into account in 
the perturbative uncertainty estimate using method B. By resumming the jet-binning logarithms one can 
obtain improved predictions with reduced (and more sophisticated) uncertainties using method C. 

Here we have focused our discussion on gq and cr>i and how to take into account the resulting 
jet-bin boundary. To further separate cr>i into a one-jet bin cti and a a>2 one can use method B for this 
boundary by treating Z\>2 as uncorrelated with the total uncertainty for cr>i from either methods B or 
C. Examples of utilizing method B for this jet bin boundary can be found in Ref. II26I . Once it becomes 
available one can also use a resummed prediction with uncertainties for this boundary with method C. 
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13. A NLO BENCHMARK COMPARISON FOR INCLUSIVE JET PRODUCTION AT 
HADRON COLLIDERS E3 

Abstract 

We present a benchmark comparison of two next-to-leading order (NLO) cal- 
culations for inclusive jet and jet pair production at hadron colliders. A new 
version of the NLO code EKS is adapted for computation of differential cross 
sections and compared to an independent calculation based on the FastNLO 
code. A percent-level agreement between the two codes is observed for spec- 
ified settings of computations at typical transverse momenta and rapidities of 
Tevatron and LHC measurements. We identify theoretical prerequisites for 
achieving such level of agreement and comment on the stability of NLO cal- 
culations with respect to the factorization scale choice. 

13.1 INTRODUCTION 

Inclusive jet production at hadron colliders provides an excellent opportunity to test perturbative QCD 
(PQCD) and look for possible new physics beyond the Standard Model (SM) over a wide range of 
energy scales. Single inclusive jet production in the Tevatron Run-2 has been recently used to determine 
the QCD coupling constant 0771 and constrain parton distribution functions (PDF) in the proton in 
global QCD analyses by several groups ||255[ 13781 1262[ [3Tll. Jet production data provide constraints 
on the gluon PDF at large x values, possibly in a combination with small- a; quark PDFs, as discussed 
in Section [T] Invariant mass distributions of dijets B79i l. angular distributions 113801 13811 . and other 
jet observables at the LHC 113821 12931 1292] provide a unprecedented opportunity to extend searches for 
quark compositeness and new particle resonances toward the highest energies attainable. 

In this contribution, we examine agreement between the computer programs that are available for 
NLO calculations of jet production cross sections. NLO QCD predictions for jet production work re- 
markably well in a wide kinematical range and across many orders of magnitude of the cross sections. 
Nonetheless, the latest PDF analysis evaluates many scattering processes up to NNLO in perturbative 
QCD. Jet production observables are pivotal for constraining the laige-x gluon PDF, but remain known 
to NLO only. We identify and document main factors affecting NLO jet cross sections at a few-percent 
level of accuracy and compare the numerical results for typical collider kinematics. Differences between 
the programs used, and choices for the theoretical inputs made, may be responsible for some differences 
observed between CTIO and other PDFs, as explained below. Such NLO benchmark comparison will be 
useful for quantifying or reducing the uncertainties on the resulting PDFs and for the future implemen- 
tation of NNLO and higher-order resummed contributions to the jet cross sections. 

From the experimental point of view, jet production has an advantage of very high statistics and 
a drawback of sizeable systematical errors associated with complexities of jet reconstruction. NLO 
theoretical uncertainties due to the QCD scale dependence and the fixed-order model for the jet algorithm 
are comparable to the experimental errors. Control of numerical accuracy involves, in particular, careful 
tuning of Monte-Carlo integration to handle steeply falling jet cross sections. 

An early numerical code (EKS) for the NLO calculation of single-inclusive jet and dijet distribu- 
tions was developed by S. D. Ellis, Z. Kunszt and D. E. Soper in 1990's 1383] based on the subtraction 
method. Two other widely used numerical programs are NLOJET-H-i- B84. 3851 and FastNLO 1290, 291il . 



^^Contributed by: J. Gao, Z. Liang, P. M. Nadolsky 
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The latter provides a fast interface to obtain NLO predictions in kinematical bins of already published 
experimental jet cross sections by interpolating table files produced by NLOJET++. Besides these fixed- 
order calculations, POWHEG combines the NLO jet production cross sections with leading-logarithm 
QCD showering effects II386II . Some phenomenological studies also include partial NNLO contributions 
to jet cross sections obtained by threshold resummation ||387i 

The agreement between the above NLO numerical programs is not automatically met, which mo- 
tivates the present benchmark comparison. The past CTEQ PDF analyses computed NLO jet cross 
sections using NLO K-factor tables produced by the EKS code, while other PDF analysis groups use 
FastNLO. Since the CTIO NLO gluon PDF behaves somewhat differently at large x than the gluon PDF 
from MSTW'08 or other groups II2621| . one must compare the EKS and FastNLO computations for the 
same input values to confirm that these programs do not cause the observed disagreement. 

Here we show that the results for the Tevatron {^/s = 1.96 TeV) and LHC (y^ = 7 TeV) from 
EKS and FastNLO agree well when the computation parameters are chosen as described in the next 
section. These settings must be consciously controlled in order to reach acceptable agreement. As a 
result of this work, the EKS code has been revised to improve its stabihty and efficiency and to implement 
output into new differential cross sections 1,388,1 . 

13.2 Theoretical setup and inputs 

Several theoretical inputs must be matched exactly between the EKS and FastNLO programs in order to 
reach the level of agreement shown in the figures below. 

• Jet algorithm. When calculating the distribution of jet observables, we need to use the same 
jet algorithms as the ones in the experimental measurements. In this comparison, we utilized 
the cone-based Midpoint algorithm [389| for the Tevatron observables and cluster-based anti-^T- 
algorithm 1134 lH for the LHC. The only difference between the Midpoint algorithm and modified 
Snowmass algorithm II389II used in the original EKS program is that the Midpoint algorithm always 
starts with the middle point between the two partons' directions as a seed for a new protojet, no 
matter how large their separation is. In the NLO theoretical calculations for single-jet or dijet 
production that include at most three final-state partons, the cluster-based kx Ii390il , anti-ZcT, and 
Cambridge-Aachen (CA) 1139 111 algorithms are equivalent. 

• The recombination scheme is a procedure for merging two nearby partons into one jet. For 
example, the energy scheme (4D, based on adding the 4-momentum) or Et scheme (based on 
adding the scalar E-r, then averaging over the partons' directions using Et as the weights) can 
be employed to find the momentum of the merged jet II392I . Our comparison uses the energy 
scheme for both the Tevatron and LHC measurements, as it is often used by the recent experiments. 
Different choices of the recombination scheme can cause differences of up to ten percent in the 
NLO distributions, as will be shown later. Note that, with the energy scheme, the jet could be 
massive, which means that the jet's pseudorapidity will not be equal to its rapidity. 

• The jet trigger imposes acceptance conditions on each jet's px or rapidity when deciding if this 
jet's contribution is included into the jet observable. In NLO calculations of single-inclusive jet 
distributions, the jet trigger conditions have no influence. In dijet production, they may change the 
cross sections by small amounts by affecting the selection of two leading jets in some cases. In our 
dijet calculations we choose pT > 40 GeV, \y\ < 3 for each jet at the Tevatron and pT > 30 GeV, 
\y\ < 3 at the LHC. 

• Renormalization and factorization scales. The scale choice is only related to theory and has no 
correspondence in experiment. It is conventional to choose the renormalization and factorization 
scales to be of order of the typical transverse momentum pT of the jet(s): ^r ^ ^p ^ Pt- 
In contributions with two resolved jets, pT naturally corresponds to the transverse momentum 
of either of the final-state jets (which are equal by momentum conservation). More ambiguity 
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is present in contributions with three resolved jets, when pT can correspond to the transverse 
momentum of either of the jets in each event or to a combination of three transverse momenta. 
A meaningful comparison must use equivalent definitions of "jet px " in the renormalization and 
factorization scales of both NLO calculations. 

When FastNLO interpolates tables of NLOJET++ cross sections for single inclusive-jet produc- 
tion, it sets fifj and jj^p proportional to the pT value at a fixed point in each px bin of the experi- 
mental data. Given the high precision of the latest PDF analyses, the FastNLO scale convention 
produces a numerically different result than the scale proportional to the px of the leading jet or 
the average pr of two leading jets in each event. It depends on the binning of the experimental 
data and is numerically close to the average pT in each bin for small enough bins. 
In the EKS calculations for single-jet production, we set the scale proportional to pT of each 
individual jet in any px bin, which means that we repeat the evaluation of the matrix elements with 
three resolved jets (contributing to three pT bins) by successively setting fiR^p to be proportional 
to the pt of each jet in the event. Such matrix elements are thus evaluated three times. This 
event-level scale setting of EKS turns out to be numerically close to the bin-level scale setting of 
FastNLO if the bin sizes are small. However, a few-percent differences are still observed at the 
largest rapidities and pT- For dijet production, FastNLO and EKS choose the //r and ^p scales 
that are proportional to the average \pt\ = {\pti\ + |pt2|)/2 of the two leading jets. 
• Monte-Carlo integration. Precision calculations for jet production are numerically challenging 
because of the rapid falloff of the cross sections with the jet's px and rapidity, and also because of 
large numerical cancellations occurring between some 2 — )^ 2 and 2 — ;■ 3 contributions. Both EKS 
and NLOJET-i-i- evaluate differential cross sections by Monte-Carlo integration, which requires to 
generate of order 10^ of sample points to achieve percent-level accuracy for the whole kinematical 
region. The upgraded EKS code performs the Monte-Carlo integration using the VEGAS method 
from the CUBA2.1 library II393II . The EKS output is produced in the form of two-dimensional 
cross sections {d? a / {dpT dy) , d'^a/{dMjj dy), ...) and stored in finely binned two-dimensional 
histograms. Such output is "almost fully differential" in the sense that the finely grained histograms 
can be rebinned into any set of coarse bins of the given experiment at the stage of the user's final 
analysis. This format is different from the FastNLO format, which provides the cross sections in 
coarse bins taken from pre-existing experimental publications. 

The fine binning in EKS is introduced at the stage of Monte-Carlo integration in order to improve 
convergence and to better handle the NLO cancellations. The Monte Carlo sampling pattern is 
tuned automatically to ensure that all fine bins are filled with comparable numbers of sample 
points, regardless of the momentum and scattering angle values associated with each bin. Then 
we get uniform relative errors on the cross sections in all bins without consuming too much CPU 
time, and despite the dramatic variation of cross sections across the bins. Finally, EKS includes 
a module to allow for flexible choices of scales /x/j and ^p, and another module for calculating 
differential cross sections of user-provided jet observables. 

13.3 RESULTS 



Figs. 34 39 compare our representative numerical results with the ones provided by FastNLO for pT 
distributions of single jets, invariant mass distributions of dijets, and (in the case of DO Run-2) angular 
distributions (x) of dijets. Kinematical bins of the Tevatron {y/s = 1.96 TeV) ll287l l285l l289l [3941 and 
LHC (^ = 7 TeV) ||293ll292]| measurements, and CTEQ6.6 PDFs ||256l were used. The cone sizes R 
of the jets are indicated in the figures. 

Left panels in the figures show ratios of EKS to FastNLO cross sections, UEKs/oTastNLO^ at the 
LO (red points) and NLO=LO-i-NLO correction (blue points), in kinematical bins provided by the exper- 
iments. The horizontal axis indicates the ID of each bin, which are arranged in the order of increasing 
jet rapidity y and then jet's pT for inclusive jet production, y and then Mjj for dijet production, and Mjj 
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then X for dijet angular dependence. Vertical lines indicate the boundaries of each rapidity interval for 



single-jet and dijet distributions, and of each dimass interval for the x distribution. For example, Fig. 34 
shows (TEKs/o'FastNLO in 6 bins of jet rapidity, with bins 1...23 corresponding to the first rapidity bin 
{\y\ < 0.4), bins 24.. .45 corresponding to the second rapidity bin (0.4 < \y\ < 0.8), and so on. 

The left panel includes, from top to bottom, three plots obtained with the renormalization and 
factorization scales equal to 1/2, 1, and 2 times the center scale. We can see a good overall agreement 
between EKS and FastNLO both at LO and NLO. The only significant discrepancies are found in the 
highest pt bins for both the Tevatron and LHC single inclusive jet production, which may be due to 
the difference in the scale choices used in EKS and FastNLO. [These differences reduce when going to 
NLO]. In the EKS single-jet calculation, we use the actual px of the partonic jet filled into the bin as the 
scale input. FastNLO sets the scale according to a fixed px value in each experimental bin, which tends 
to be different from the EKS scale in the highest px bins, which have large widths. The same reason 
causes a small normalization shift in the other pT bins. 

For dijet production, we only observe random fluctuations at highest Mjj that are mainly due to 
numerical integration errors. 



In the right panels of Figs. 34 39 we present plots of the NLO K factor from EKS for each 
distribution, defined as the ratio of the NLO differential cross section to the LO one. The value of the K 
factor and its stability with respect to the scale choice may provide an indication of the magnitude of yet 
higher-order corrections. 

To minimize the potential effect of higher-order terms, one might opt to choose the renormalization 
and factorization scales that bring the K factor close to unity in most of the kinematical region. An 
alternative approach for setting the scale is based on the minimal sensitivity method, which suggests to 
choose the fiR and ^p values (taken to be equal and designated as fi in the following) at the point where 
the scale dependence of the NLO cross section is the smallest. 

In (di)jet production at central rapidities at the Tevatron, both requirements (K sa 1 
and da]\jLo{fJ')/dfJ. ~ 0) could be satisfied by choosing /i ss O.Bpr', see, e.g., the appendix in Ref. [395 1. 
For this reason, the scale pt/2 was used in the CTIO study. However, the point of the minimal sensitivity 
shifts to higher values (close to px or even higher) at forward rapidities at the Tevatron or at all rapidities 
at the LHC. For such higher scales, however, it is hard to satisfy the requirement that K remains close to 
unity at the same time. 

This point is illustrated by our plots of the K factors. At the central rapidities and /x/j = jip = 



0.5pT at the Tevatron (the lowest 3 rapidity bins in Figs. 34 31 ), K ^ 1 and is relatively independent 
of Pt, as seen in the top subpanels. However, with this scale choice the K factor deviates significantly 
from unity and has strong kinematic dependence if the rapidity and px are large. If one chooses the scale 
that is equal to px or even 2px (the middle and bottom figures), in accord with the minimal sensitivity 
method for the forward bins, the kinematical dependence of the K factor reduces, but its value increases 
to 1.3-1.6 in most of the bins. 



For CMS kinematics (Figs. 38p9 l, the K factor has significant kinematical dependence for all 



central scale choices, however, the choice fiR = fJ,F = Pt (the middle subpanels) results in a compar- 
atively flatter K factor that is also closer to unity. We can see that it is hard to find a fixed scale (or a 
scale of the type px x (a function of y) II383II ) that would simultaneously reduce the magnitude of the 
NLO correction and stabihze its scale dependence and kinematical dependence. The scale 0.5px may be 
slightly more optimal at the Tevatron, and the scale px may be slightly better at the LHC. In the absence 
of a clearly superior scale choice, it may be necessary to vary the scale of jet cross sections in the global 
fit in order to estimate its effect on the PDF errors. 



In Figs. 40 and 41 we plot the ratios of the NLO distributions calculated using different recom- 
bination schemes, where a^D is obtained with the energy scheme, and apj^ is with the Ex scheme. For 
single inclusive jet production at both the Tevatron and LHC, apj- is larger then a^D- An opposite trend 
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is observed in dijet production. Differences of the predictions based on the two schemes are larger with 
the Midpoint algorithm (used at the Tevatron) than with the anti-A;^- algorithm (used at the LHC). In an 
NLO calculation, the Midpoint algorithm allows a larger maximal angular separation {2R) between the 
two partons forming a jet, compared to the anti-A;^ algorithm that only allows the angular separation up 
to R. This produces the shown kinematical differences between the two schemes. 

CONCLUSIONS 

Jet production plays an important role at hadron colliders and is a main background process in the bulk 
of new physics searches. A benchmark comparison of NLO QCD predictions for jet production from 
different numerical codes can be useful for both the ongoing phenomenological studies and upcoming 
higher-order calculations. In this work we modify the original EKS program and compare the single- 
jet and dijet cross sections that it produces with the ones from the FastNLO program. We find a good 
agreement between two programs, apart from differences of up to 5-10% occuring at the highest jet p^'s 
and rapidities. We document the exact combination of theoretical settings in EKS that are needed to 
reproduce the FastNLO results. Based on the EKS calculation, we attempted to identify the choice of 
the renormalization and factorization scales that could simultaneously reduce the magnitude of NLO K 
factors and/or scale dependence of the NLO cross section. Since we could not easily find such a scale 
combination, we propose to vary the factorization and renormalization scales in future (N)NLO PDF fits 
to better estimate theoretical uncertainties in the resulting PDFs. There is a plan to publish the updated 
EKS program in the near future [i388il . 
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CMS LHC (7 TeV) measurement. ll293l 
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14. PHENOMENOLOGICAL STUDIES WITH AMC @ NLO E3 

Abstract 

We present four phenomenological studies of hadron collider processes per- 
formed within the aMC@NLO framework 

14.1 Introduction 

aMC@NLO (http://amcatnlo.cern.ch) is a fully automated approach to complete event generation and 
subsequent parton shower at the NLO accuracy in QCD, which allows accurate and flexible simulations 
for both signals and backgrounds at hadron colliders. All calculational aspects in aMC@NLO are 
automated. One-loop contributions are evaluated with MadLoop [8. 396| , that uses the OPP integrand 



reduction method 111211 as implemented in CutTools II135L The other matrix-element contributions to the 
cross sections, their phase-space subtractions according to the FKS formalism [397], their combinations 
with the one-loop results, and their integration are performed by MadFKS [398 |q] The matching of the 
NLO resuhs with HERWIG 13991 or PYTHIA [4001 parton showers is performed with the MC@NLO 
method II401II . and it is also completely automatic. Finally, aMC@NLO can compute scale and PDF 
uncertainties at no extra CPU-time cost with the help of the process-independent reweighting technique 
described in Ii402i] . 

For all technical details we refer to the original publications. We report here on the physics results 
obtained with aMC@NLO for observables of interest at hadron colUders |[T5ll4031l402l l404|. We stress 
that they are simulated at the hadron level, namely including parton shower and hadronization effects. 
In Sects. 14.2 14.3[ and 14.4 we present results for the production of ttH, Vbh, and four-lepton final 



states at the LHC, respectively. Section 14.5 reports on a study of the Wjj process at Tevatron. Finally, 
in sect. 14.6 we draw our conclusions. The list of the processes considered here should convince the 
reader that one can perform realistic analyses of experimental data, for signals and backgrounds, entirely 
within the aMC@NLO framework. 

14.2 The ttH process at the LHC 

The production process oi & H boson in association with a top pair [15] is a classic mechanism for 
Higgs production at the LHC ll24l|405L where the large ttH Yukawa coupling and the presence of top 
quarks can be exploited to extract the signal from its QCD multi-jet background. As an example of the 



use of aMC @NLO for this process we present, in Fig. 42 the Higgs transverse momentum distribution 
and the transverse momentum of the ttH or ttA system at the a/s= 7 TeV LHC for a Standard Model 
(scalar) Higgs with Mh = 120 GeV and for a pseudoscalar one with Ma = 120/40 GeV. The total NLO 
cross sections in the three cases are crNLo(-^/f = 120) = 103.4 fb, cjnloC-^a = 120) = 31.9 fb, and 
i^NLol-^A = 40) = 77.3 fb, respectively. At moderate values of the Higgs transverse momentum, the 
scalar and pseudoscalar cases are clearly distinguishable, while at larger values the three distributions 
tend to coincide. Parton shower effects give in general small corrections with respect to the a pure NLO 
calculation, except for variables involving all produced particles, such as the transverse momentum of 
the ttH or ttA system shown in the right panel of Fig. |42] 



14.3 The Vbh process at the LHC 

With Vbb we understand ivbb and £~^i~bb final states II403I . which are the main backgrounds to searches 
for SM Higgs production in association with vector bosons (WH/ZH), with the subsequent Higgs decay 
into a bb pair. The aMC @ NLO framework allows a realistic study including 

• NLO corrections; 

^* Contributed by: R. Frederix, S. Frixione, V. Hirschi, F. Maltoni, R. Pittau, R Torrielli 

^^The validation of MadLoop and MadFKS in the context of hadronic collisions has been presented in Ref. (8). 
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Fig. 42: Higgs transverse momentum distributions (left) and transverse momentum of the ttH or ttA 
system (right) in ttH /ttA events at the LHC (^=7 TeV), with aMC@NLO in the three cases: Scalar 
(blue) and pseudoscalar (magenta) Higgs with mfj/A = 120 GeV and pseudoscalar (green) with niA = 
40 GeV In the lower panels of the left part, the ratios of aMC@NLO over LO (dashed), NLO (sohd), 
and aMC@LO (crosses) are shown. Solid histograms in the right panel are relevant to aMC@NLO, 
dashed ones to a pure NLO calculation. 



• bottom quark mass effects; 

• spin-correlation and off-shell effects; 

• showering and hadronization. 



As an example we show, in Fig. 43 the invariant mass of the pair of the two leading b-jets, compared 
with the signal distributions for a standard Higgs with m// = 120 GeV. Fig. 43 is interesting because 
both signal and background are studied at the NLO accuracy. It should be noted that, since completely 
hadronized events are simulated, sophisticated studies of the jet sub-structure are possible within the 
aMC@NLO framework, as presented in Fig. |44j where the fractions of events containing zero 6-jets, 
exactly one 6-jet, and exactly two &-jets are plotted. The 6-jet fractions are fairly similar for Wbb and 
Zbb production, and the effects of the NLO corrections are consistent with the fully-inclusive K factors. 
On the other hand, the 66-jet contribution to the 6-jet rate is seen to be more than three times larger for 
i^vbb than for i^l^bb final states. This fact is related to the different mechanisms for the production 
of a bb pair in the two processes. At variance with the case of l^ubb production, in a i^£^bb final state 
the two 6's may come from the separate branchings of two initial-state gluons, and thus the probability 
of them ending in the same jet is much smaller than in the case of a. g ^ bb final-state branching, which 
gives the only possible contribution to a £^ vbb final state. 



14.4 Four-lepton production at the LHC 

Vector boson pair production is interesting in at least two respects. Firstly, it is an irreducible background 
to Higgs signals, in particular through the W~^W and ZZ channels which are relevant to searches for 
a standard model Higgs of mass larger than about 140 GeV. Secondly, di-boson cross sections are quite 
sensitive to violations of the gauge structure of the Standard Model, and hence are good probes of sce- 
narios where new physics is heavy and not directly accessible at the LHC, yet the couplings in the vector 
boson sector are affected. We consider here the neutral process II402I 
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Fig. 43: Invariant mass of the pair of the two leading 6-jets. WH{-^ ivhb), ZH{ 
i^l^bb results are shown, with the former two rescaled by a factor of ten. 
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Process 


Cross section (fb) 


qq/qg channels 


gg channel 


0{a'J O(a0) + O(aJ 


0(«^) 


pp — >• e^e^e^e^ 


Q 1 Q 19 nn+0-27{2.1%)+0.26(2.0%) 
s-.i^ ±z.»u_g23(1.8%)-0.22(1.7%) 
4 ^8 fi .o+0.13(2.1%)+0.11(1.7%) 
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^ c^„+0.162(28.6%)+0.012(2.1%) 
^•^""-0.118(20.8%)-0.014(2.5%) 



Table 10: Total cross sections for e~^e~ fi'^fi~ and e~^e~e'^e~ production at the LHC (VS = 7 TeV) 
within the cuts M{i^£^'^^) > 30 GeV. The first and second errors affecting the results are the scale and 
PDF uncertainties (also given as fractions of the central values). 



which, although smaller than the W~^W~ channel, may provide a cleaner signal due to the possibility of 
fully reconstructing the decay products of the two vector bosons. aMC @NLO predictions for the cross 
sections are given in Tab.[TO{ which also includes aMC@NLO estimates for scale and PDF uncertainties. 



The four-lepton invariant mass and the transverse momentum distribution are presented in Fig. 45 where 



comparisons between the results obtained with aMC @NLO matched to HERWIG and to PYTHIA are 
also given. We stress that these results include the contributions due to (^^r-initiated processes, which 
have also been computed automatically. These are formally of NNLO, but may play a non-negligible 
phenomenological role owing to their parton-luminosity dominance at a large-energy collider such as the 
LHC. 



14.5 Wjj at Tevatron 

In II406II CDF reported an excess of events in two-jet production in association with a W boson, in the 
form of a broad peak centered at Mjj = 144 GeV in the dijet invariant massPl Motivated by this fact, we 
present in Fig.|46]the aMC@NLO prediction 14041 for the dijet invariant mass in Wjj events, using the 
same cuts as CDF and DO in the signal region, also comparing with a pure NLO computation and with the 
Alpgen 04O8II findings (one-, two-, and three-parton multiplicities have been consistently matched to 

^'^Such an excess has so far failed to be confirmed by a very similar DO analysis |407 1. 
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Fig. 44: Fractions of events (in percent) that contain: zero 6-jets, exactly one 6-jet, and exactly two 6-jets. 
The rightmost bin displays the fraction of 6-jets which are &6-jets. The two insets show the ratio of the 
aMC@NLO results over the corresponding NLO (solid), aMC@LO (dashed), and LO (symbols) ones, 
separately for Wbb (upper inset) and Zbb (lower inset) production. 



obtain the latter). Perturbative, parton-level results agree well with those obtained after shower, and PDF 



and scale uncertainties (also reported in Fig. 46 1 are well under control. In summary, we do not observe 
any significant effects in the shape of distributions due to NLO corrections, which therefore cannot be 
responsible for the excess of events observed by the CDF collaboration. 



14.6 Conclusions 

The results we have presented in this contribution are based on the strategic assumption that, for the word 
automation to have its proper meaning, the only operation required from a user is that of typing-in the 
process to be computed, and other analysis-related information (such as final-state cuts). In particular, the 
codes that achieve the automation may only differentiate between processes depending on their general 
characteristics, but must never work on a case-by-case basis. The aMC@NLO framework is based 
on such an assumption, providing a very powerful tool to compare, at the NLO accuracy including 
showering and hadronization, theory and experiment in high energy collisions. As an example of the 
flexibility of aMC@NLO we have presented results for the processes pp — )• ttH, pp — )• Vbb, pp — )• 
£+£-£(')+£(')- at the LHC, and a study of pp -^ Wjj at Tevatron. 
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15. PROBING CORRECTIONS TO DUET PRODUCTION AT THE LHC 

Abstract 

We compare and discuss a few kinematic distributions for dijet production 
at the LHC, computed with a fixed next-to-leading order code, with the 



Contributed by: S. Alioli, J. R. Andersen, C. Oleari, E. Re, J. M. Smillie 
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Fig. 45: Four-lepton invariant mass and the transverse momentum distributions for aMC@NLO 
+gg HERWIG (solid black) and PYTHIA (dashed blue) results. The rescaled gg contribu- 
tions with HERWIG (open black boxes) and PYTHIA (open blue circles) are shown separately. 
Middle insets: scale (dashed red) and PDF (solid black) fractional uncertainties. Lower insets: 
aMC@NL0/(aMC@NL0+c/5) with HERWIG (solid black) and PYTHIA (dashed blue). 



POWHEG BOX and with HEJ. Previous experimental studies have dealt with 
kinematic distributions where the predictions of the three approaches were 
very similar. In this proceeding, we investigate kinematic distributions where 
the resummed effects in POWHEG and HEJ are clearly shown and enhanced 
with respect to the fixed NLO result, since different QCD-radiation regimes 
are probed. 

15.1 Introduction 

Dijet production is one of the cornerstone processes at the LHC. The cross section for jet production is 
very large, making it an important testing ground for our understanding of QCD at high-energy scales. 
In addition, jet production is an important background for many searches for new physics. It is therefore 
essential to probe and test our theoretical predictions. Dijet-production studies can bring insights in jet 
production in association with other particles too: for example, Higgs boson production plus two jets in 
gluon fusion, a key process for assessing the CP properties of the Higgs boson, can benefit from these 
studies. 

There have been a number of very interesting experimental studies in dijet production by both the 
ATLAS L4101 Sm [293 and CMS MM SEl gH ESJ Collaborations. It is already clear that higher 
order QCD contributions beyond a fixed order, low multiplicity calculation can be important because the 
large available phase space for jet emission at the LHC compensates for the suppression of extra powers 
of the strong coupling constant. 

In this contribution, we compare two theoretical approaches to dijet production that include higher 
order effects: POWHEG gTH gTIl |386l [Bj and HEJ MM ES US. The POWHEG method suc- 
cessfully merges a fixed next-to-leading order (NLO) calculation with a parton shower program, that 
resums leading logarithmic contributions from coUinear emissions. In this study, the POWHEG results 
obtained with the POWHEG BOX [Td] are interfaced with the transverse-momentum-ordered shower 
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Fig. 46: Invariant mass of the pair of the two hardest jets, with CDF/DO cuts of II406II (left) and of II409I 
(right). 



provided by PYTHIA 6.4.21 II400L In contrast, the starting point for HEJ is an all-order approximation 
to the hard scattering matrix element in the regime of wide-angle QCD emissions. HEJ is accurate at 
leading logarithmic precision in the invariant mass of any two jets. This is then supplemented with the 
missing contributions (through a merging and reweighting-procedure) necessary to also ensure tree-level 
accuracy for final states with up to four jets. The tree-level matrix elements are taken from Standalone 
Madgraph |[T63l . 

The POWHEG and HEJ approaches are clearly very different in their description of QCD radia- 
tion. Nevertheless, for several kinematic distributions (see for example ref. [410|) the predictions from 
POWHEG and HEJ are very similar. In this study, we investigate various observables which can expose 
the differences in the two approaches and we compare them with the fixed NLO results. 



15.2 A comparison between NLO, POWHEG and HEJ in dijet production 

In order to avoid biasing our event sample, we impose a minimal set of cuts, avoiding symmetric cuts on 
the jet transverse momenta that would give an unphysical cross section at fixed NLO level II420I 142111 . 
due to the presence of unresummed logarithms. Neither the POWHEG or HEJ descriptions suffer from 
this instability. However, in order to have a sensible fixed NLO cross section to compare with, we impose 
asymmetric cuts 

p{ > 35 GeV, p{' > 45 GeV, \yj\ < 4.7 , (99) 

i.e. all jets are required to have a minimum transverse momentum of 35 GeV, and the hardest-jet trans- 
verse momentum, j^-^ , is required to be greater than 45 GeV. In order to comply with the experimental 
acceptance, all jets are further required to have an absolute rapidity \yj\ less than 4.7. Jets are defined 
according to the anti-kt jet algorithm, with radius R = 0.5. Only events with at least two jets fulfilling 



Eq. ( 99 1 are kept. 

In the following, we compare the fixed NLO cross section with the POWHEG first emission re- 
sults, with the POWHEG results showered by PYTHIA and with the HEJ predictions. The renormal- 
ization and factorization scales have been chosen equal to the transverse momentum of the hardest jet 
in each event, for the HEJ predictions. For the NLO computation (and for computing the POWHEG 
B function), scales are set to the transverse momentum of the so called underlying Born configuration. 
Scale-uncertainty bands obtained by varying these scales by a factor of two in each direction are shown 
for the NLO and HEJ results. The scales entering in the evaluation of parton distribution functions and 
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Fig. 47: The average number of jets as a function of Ay^h (left plot) and of i/^ (right plot), as predicted 
by a fixed NLO calculation, by POWHEG first emission, by POWHEG+PYTHIA and by HEJ. The 
dotted red lines around the HEJ prediction and the green ones around the NLO result are obtained by 
varying the renormalization and factorization scales by a factor of two around their central value. 



of the strong coupling in the POWHEG Sudakov form factor are instead evaluated with a scale equal to 
the transverse momentum of the POWHEG hardest emission II417II386TI . 



In Fig. 47 we plot the average number of jets as a function of the rapidity difference between 
the most forward and most backward of the jets fulfilling Eq. ( [99) ), Ai/fb, on the left-hand side, and 
as a function of H^ = J2jPt '^^ "^^e right-hand side. The wide-angle resummation implemented in 
HEJ produces more hard jets than POWHEG and the fixed NLO calculation, as the rapidity separation 
between the most forward and the most backward jet in the event increases. Both the NLO and the first- 
emission POWHEG results have at most 3 jets, so that the average number of jets cannot exceed 3, and 
give similar results. Additional jets are instead produced by the PYTHIA shower, so that the average 
number of jets is increased by roughly 20% with respect to the NLO one, for Z\yfb ~ 7. For the same 
separation in rapidity, the HEJ prediction is 45% larger than the NLO result, with a chance to distinguish 
among the three approaches. 

The dependence of the average number of jets from H-^ (right plot) displays a different behaviour: 
here the showered events have on average more jets than HEJ and the NLO results, as the sum of the 
transverse momentum of all the final-state jets increases. It is interesting here to comment on the NLO 
result obtained with the factorization and renormalization scales set to p™/2, i.e. half of the transverse 
momentum of the underlying Born configuration. In fact, from the plot, an unphysical behaviour of this 
quantity emerges: the average number of jets is greater than 3 above H-^ ~ 270 GeV. This is due to the 
fact that the high if^ region is populated mostly by events with 3 jets, two of which have approximately 
the same high transverse momentum, and the third one is softer with respect to the other two (the cuts in 
Eq. ( [99] ) are always in place). In this configuration, the exclusive two-jet cross section becomes negative, 
due to incomplete cancellation of the virtual (negative) contribution, now enhanced by a higher value of 
the strong coupling constant, evaluated at a lower renormalization scale. A more detailed discussion can 
be found in ref. 114221 . 



As a last example of a kinematic distribution that displays different behaviour if evaluated at NLO 
or using POWHEG or HEJ, we plot in Fig. 48 the average value of cos(7r — (/)fb), where (p^h is the 
azimuthal angle between the most forward and backward jets, as a function of their rapidity separation 
Z\yfb. For dijet events at tree-level, </)fb = vr since the two jets must be back- to-back, and the average 
value of the cosine is 1. Deviation from 1 then indicates the presence of additional emissions, so that 
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Fig. 48: The average value of cos(7r — (?!)fb) as a function of Z\yfb, where 0fb is the azimuthal angle 
separation between the most forward and most backward jet. The dotted red and green lines are obtained 
by varying the renormalization and factorization scales by a factor of 2 in both directions. 



this kinematic distribution carries information on the decorrelation between the two jets. This quantity 
is more inclusive than the average number of jets as it is sensitive also to emissions below the jet p-^ cut. 
The higher radiation activity in POWHEG+PYTHIA and in HEJ, with respect to the fixed NLO and the 
POWHEG first-emission results, is clearly visible in the figure: the stronger jet activity produced by HEJ 



at higher rapidity separation (see the left plot of Fig. 47 1 lowers the average value of the cosine below the 
POWHEG+PYTHIA result. As expected, the average value predicted by the POWHEG first-emission 
and the NLO calculation is closer to 1 , since they contain at most one radiated parton. 



Conclusions 

In this proceeding, we have discussed the results obtained using a fixed NLO calculation, HEJ and 
POWHEG-i-PYTHIA, in the description of three kinematic distributions, selected in order to display 
more clearly the differences among the three approaches: the average number of jets and azimuthal 
decorrelation between the most forward and the most backward jet, plotted as a function of the rapidity 
separation of the most forward and the most backward jet, and the average number of jets plotted as a 
function of the sum of the transverse momenta of all the jets in the event. 

While the Umitations of the NLO calculation are clearly visible when we probe regions of the 
phase space where multi-jet emissions becomes important, the predictions of POWHEG-i-PYTHIA and 
HEJ are distinguishable when dealing with the average number of jets as a function rapidity span. Less 
marked differences are found as a function of Ht, and in the study of the azimuthal decorrelation of the 
most forward and backward jet. 

An experimental analysis of the dijet data, collected at the LHC, should then follow to investigate 
to which extent our theoretical knowledge for these kinematic distributions is under control. 
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16. W+JETS PRODUCTION AT THE LHC: A COMPARISON OF PERTURBATIVE TOOLS 

Abstract 

In this contribution, we discuss several theoretical predictions for W plus jets 
production at the LHC, compare the predictions to recent data from the ATLAS 
collaboration, and examine possible improvements to the theoretical frame- 
work. 

16.1 Motivation 

Experimentalists are reliant on a number of tools, at LO and NLO, at parton level and at hadron level, in 
order to understand both simple and complex final states at the LHC. One of the benchmark processes, 
for both signals to new physics and for their backgrounds, is the production of W plus jets. In this 
contribution, we discuss several different predictions for the W plus jets final state, concentrating on the 
Ht distribution. We examine where the predictions agree, and where they disagree and compare the 
predictions to LHC data. We introduce the idea of NLO 'Exclusive Sums', and discuss the performance 
of this technique and consider also how LoopSim may be able to improve the predictions. We document 
the use of ROOT ntuples for W plus jets predictions produced by the BlackHat-i-Sherpa collaboration, 
indicating how they can be used to examine the variation of the cross sections with jet size/algorithm, 
PDFs, and scale choices. We also study the possibility of using the LoopSim method together with 
BlackHat-i-Sherpa type ntuples, since this may offer the opportunity to improve on the results from NLO 
Exclusive Sums. 

16.2 Theory tools: strengtiis and weaknesses 

NLO is the first order at which the normalization (and sometimes the shape) of LHC cross sections can be 
realistically calculated. The state of the art is in parton-level programs such as BlackHatH-Sherpa, where 
W + n-jet cross sections are available, with n up to 4 at NLO fTOl ISTl |22]| (and soon up to 5 114231 ). 
Of course, such parton-level final states do not allow for the full comparisons to the data allowed by the 
full parton shower Monte Carlo programs such as Sherpa. NLO matrix elements have been included into 
parton shower Monte Carlos, but only for relatively simple final states (although we note that the NLO 
matrix elements for H^ + 2 jets II404II and W + 3 jets II424II have recently been implemented in parton 
shower Monte Carlo programs). 



The Sherpa Monte Carlo program 111461 I425II includes the exact LO W + n-parton (W + n- 
jet) matrix elements, with n up to 4 (in this study), using the newer ME&TS scheme as introduced 
in Refs. [|426[ 14271 1428| for the addition of states with different jet multiplicities with the correct 
normalizations. The newer matrix-element plus parton-shower merging scheme improves over the 
CKKW 114291 I430II formalism by allowing for a better interplay between the matrix-element and paiton- 
shower descriptions. This in particular required the implementation of truncated showers ('TS'). As 
before, additional jets are, of course, then produced by the parton shower. Both BlackHatH-Sherpa 
and Sherpa rely on DGLAP-based evolution of gluon emission, on the assumption that the gluon emis- 
sions are strongly ordered in transverse momentum. For an alternative prediction, we use the program 
HEJ ||418[|419ll4311 . The High Energy Jets (HEJ) framework provides a leading-log resummation of the 
dominant terms in the limit of large invariant mass between jets. In addition, HEJ contains a merging 
procedure to ensure tree-level accuracy for final states with two, three or four jets. 



^"Contributed by: J. R. 'Andersen, J. Huston, D. Maitre, S. Sapeta, G. P. Salam, J. M. Smillie, J. Winter 
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A NLO n-jet prediction produces events with with either n or n + 1 partons. For observables 
for which higher multiplicities have a significant impact, this limitation can be detrimental. If one has 
predictions for different multiplicities, one can try to combine them by avoiding double counting by re- 
quiring that the n-jet prediction is used only to describe n-jet events (except for the highest multiplicity 
where (n + l)-jets configurations are allowed). This procedure is crude and does not increase the for- 
mal accuracy of the prediction which is that of NLO of the smallest multiplicity. The idea is that, in 
observables where higher multiplicities events dominate, a better prediction might be obtained. This has 
been denoted as the 'Exclusive Sums' technique. The impact of the Exclusive Sums approach depends 
on the kinematic variable under consideration. For this contribution, we consider only the Ht variable, 
defined as the sum of the transverse momenta of all of the leptons (including neutrinos) and jets in the 
event. The impact of the approach is expected to depend on the observable under consideration and it 
may be more beneficial for variables sensitive to multi-jet radiation, such as Ht, than for more inclusive 
variables such as pt,w- Comparisons for the latter are left to a study now in progress. 

16.3 Use of BlackHat+Sherpa ntuples 

As has been partially detailed in these proceedings, there have been many advances in the computation 
of the NLO corrections for multi-parton final states. Often such calculations do not exist in a compact 
user-friendly form, and other means must be taken to allow experimentaUsts to have access to the results. 
The BlackHat-i-Sherpa collaboration has chosen to make available ROOT tuples that contain all of the 
parton-level information needed to form flexible predictions. The ROOT ntuple framework is a very 
efficient way to store such information and the use of ROOT tuples is very familiar to experimentalists. 

The ROOT ntuples store the four- vectors for the final state partons, as well as their flavor informa- 
tion. The calculation is originally performed using a specific choice of PDF, as{mz), renormalization 
scale fiji and factorization scale fip, but weight information is also stored in the ntuples that allows each 
event to be easily re-weighted to any other (reasonable) values for the above parameters. (PDFs are 
varied through calls to LHAPDF II432II .) No jet clustering has been performed on the final state partons; 
jet reconstruction is left to the user, for any jet algorithm/size for which the correct counter-events are 
present in the ntuple. For the results presented here, the SISCone 1143 31 . kx II361I and anti-Zc^ [341] 
algorithms, with jet radii R of 0.4, 0.5, 0.6 and 0.7 can be used. Each of the above jet algorithms were 
run and the results stored in SpartyJet ntuples^] The SpartyJet tuples were 'friended' with the Black- 
Hat-i-Sherpa ntuples, allowing the analysis script access to all jet information. Such a flexibility allows 
for an investigation of the dependence of the physics on the details of the manner in which the partons 
are combined into jets, in a manner difficult to achieve prior to this. 



The four-vector information stored in the BlackHat-i-Sherpa ntuples is shown in Table 1 1 Note 
the variety of entries needed for the re-weighting of the cross section results, especially for the case of 
the variation of the two scales /x/j and fip. Information is stored in separate ntuples for the different 
categories of events, which are typically Born, loop (leading color and sub-leading color), real and sub- 
traction terms. For large n,mW + n-parton final states, there are many divergences present when two 
partons become collinear or one parton becomes soft. These divergences are controlled using the tradi- 
tional Catani-Seymour approach II236I . which involves the generation of many counter-events. Many of 
the events have negative weights; only the sum is guaranteed to be positive-definite. Predictions with rea- 
sonable statistical precision may require the sum of billions of events. The resultant tuples may amount 
to several Terabytes. However, the output can be subdivided into ROOT files of order 5-10 GB, allowing 
for simultaneous parallel processing of the events over multiple nodes, such as in the Tier3 facility at 
Michigan State University used for these comparisons. 



^' SpartyJet 14341 is a set of software tools for jet finding and analysis, built around the FastJet library of jet algorithms 14351 . 
SpartyJet provides four key extensions to FastJet: a simple Python interface to most FastJet features, a powerful framework for 
building up modular analyses, extensive input file handling capabilities, and a graphical browser for viewing analysis output 
and creating new on-the-fly analyses. 
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16.4 BlackHat+Sherpa predictions 

We have generated NLO predictions with the BlackHat+Sherpa predictions implementing the cuts used 
in the 2010 ATLAS W plus jets paper Ii436il . For completeness, the cuts are reproduced below: 

> 20GeV, 



lepton 

Pt 

|^lepton| ^ 2.4, 



jTrniss 

Jl/rp 



> 25GeV, 
mT,w > 40GeV, 
pj^* > 30GeV, 



I y-'°* I 



< 4.4, 

ton— jot 



> 0.5. 



In Figure 49 we show the NLO BlackHat+Sherpa prediction for the Ht distribution for W+ > 1 
jets (left) using the anti-A;^- jet algorithm with R = 0.4. As the prediction is an inclusive NLO calculation 
for W+ > 1 jets, there are contributions from both the one-jet and the two-jet final states. Note that as 
Ht increases, the contributions from the W + 2-jet subprocess also increases. On the right, we again 
show the Ht distribution, but now compute the prediction using the 'Exclusive Sums' technique, adding 
in the NLO W + 2-jet information. Now there is a significant contribution at high Ht from the W^ + 3- 



branch name 



type 



notes 



id 



nparticle 
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px 


F[nparticle] 


py 


F[nparticle] 


pz 


F[nparticle] 
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F[nparticle] 


alphas 
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kf 
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me_wgt 
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me_wgt2 
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xl 


D 



x2 



D 



xlp 
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x2p 
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idl 
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id2 
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fac_scale 




D 


ren_scale 




D 


nuwgt 




I 


usr.wgts 


D[ 


nuwgt] 



id of the event. Real events and their associated counter-terms share 
the same id. This allows for the correct treatment of statistical errors, 
number of particles in the final state 
array of the x components of the final state particles 
array of the y components of the final state particles 
array of the z components of the final state particles 
array of the energy components of the final state particles 
as value used for this event 
PDG codes of the final state particles 
weight of the event 

weight of the event to be used to treat the statistical errors correctly 
in the real part 

matrix element weight, the same as weight but without pdf factors 
matrix element weight, the same as weight2 but without pdf factors 
fraction of the hadron momentum carried by the first incoming par- 
ton 

fraction of the hadron momentum carried by the second incoming 
parton 

second momentum fraction used in the integrated real part 
second momentum fraction used in the integrated real part 
PDG code of the first incoming parton 
PDG code of the second incoming parton 
factorization scale used 
renormalization scale used 
number of additional weights 
additional weights needed to change the scale 



Table 11: Branches in a BlackHat+Sherpa ROOT file. 
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Fig. 49: The W plus jets cross section, as a function of Ht, for the NLO inclusive W+ > 1-jet prediction 
(left) and for the Exclusive Sums approach, adding in W^ + 2-jet production at NLO (right). The cross 
sections have been evaluated at a central scale of Ht/2 and the uncertainty is given by varying the 
renormalization and factorization scales independently up and down by a factor of 2, while ensuring that 
the ratio of the two scales is never larger than a factor of 2. 



jet final state as well. In Figure 50 the Ht prediction is shown using the Exclusive Sums approach, 
adding 1+2+3 jets at NLO (left) and 1+2+3+4 jets at NLO (right). It is evident that as Ht increases, 
contributions from higher jet multiplicities that are only present implicitly in a traditional inclusive NLO 
W+ > 1-jet calculation, become important. The Exclusive Sums Ht predictions agree with that for the 
inclusive NLO W+ > 1-jet calculation at low Ht, but are larger at higher Ht, and in better agreement 
with the ATLAS data (as discussed below). 

However, it can also be noticed that the scale dependences for the Exclusive Sums predictions 
apparently get better when the 2-jet NLO information is added, but significantly worse when the 3-jet 
and 4-jet information is added. As discussed in the Appendix, the reduction in scale dependence with 
the addition of the 2-jet NLO terms may be due to the stabilization of the predictions for the qq — ;■ Wq'q 
topologies. Adding the 3-jet and 4-jet NLO terms seems to destabilize the predictions. There are missing 
Sudakov terms needed to properly 'stitch' the different multiplicity samples together; it is hoped that the 
LoopSim technique may offer one way in supplying those missing terms. 



Below in Figure 51 we show the NLO BlackHat+Sherpa predictions for the Ht distribution for 
W+ > 2 jets: the inclusive calculation to the left, the Exclusive Sums result adding 2+3-jet NLO 
information in the middle and the Exclusive Sums result adding 2+3+4-jet NLO information to the right. 
Over the kinematic range covered in these plots, the Exclusive Sums technique adds less to the cross 
section at high Ht, although there is still a degradation of the scale dependence. 
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Fig. 50: The W plus jets cross section, as a function of Ht, for W+ > 1-jet production using the 
Exclusive Sums approach, and adding up to 3 jets at NLO (left) and 4 jets at NLO (right). The cross 
sections have been evaluated at a central scale of Ht/2 and the uncertainty is given by varying the 
renormalization and factorization scales independently up and down by a factor of 2, while ensuring that 
the ratio of the two scales is never larger than a factor of 2. 
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Fig. 51: The W plus jets cross section, as a function of Ht, for W+ > 2-jet production using the 
inclusive NLO production (left) and the Exclusive Sums approach, adding up to 3 jets at NLO (center) 
and 4 jets at NLO (right). The cross sections have been evaluated at a central scale of Ht/2 and the 
uncertainty is given by varying the renormalization and factorization scales independently up and down 
by a factor of 2, while ensuring that the ratio of the two scales is never larger than a factor of 2. 
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16.5 Towards interfacing BlackHat+Sherpa ntuples with LoopSim 

LoopSim is a method to simulate higher order QCD corrections, in particular those beyond NLO. It 
is expected to work best for processes with large NLO-to-LO i^T-factor, however it was found to be 
advantageous even in some cases where the A'-factor is moderate ||437i The method is based on unitarity 
and its main ingredient is a procedure that takes events from a process with n + m partons in the final 
state and produces counter- term events with n + m — l,n + m — 2,...,n particles, which approximate 
1-loop, 2-loop, etc. contributions. In contrast to the Exclusive Sums method, it enables one to introduce 
(approximate) virtual corrections beyond 1-loop, thus ensuring that the a^L^ type terms cancel for all 
the orders that are included. While we will not show LoopSim results that are directly comparable to the 
ATLAS data (the samples were generated before those cuts were made public), we will examine below 
the dependence on the pt,nim choice (which sets the size of L = lnOt/pt,mm where Ot is a transverse 
observable) and see that it vanishes as pt,mm — ^ 0. 

To distinguish between the exact result at the order N^LO and the result with simulated loops we 
use a notation in which we replace N by n for the orders simulated by LoopSim. So for example, VF + 1 
jet at nLO has approximate 1-loop diagrams and is obtained by combining 1^ + 1 jet at LO with W + 2 
jet at LO where the latter is passed through LoopSim. Similarly M^ + 1 jet at nNLO has exact 1-loop 
diagrams but simulated 2-loop contributions (by using VF + 2 jet at NLO as an input to LoopSim). 

As argued in the previous section, the BlackHat-i-Sherpa ntuples allow one to efficiently perform 
a broad range of analyses. They have however a limitation. In order to reduce the size of stored files, 
the only partonic events that are recorded for the W + n-jet sample are those in which there are at least 
n jets above a 20 GeV threshold. Since this threshold is below the jet cuts used by ATLAS and CMS, 
it is adequate for any NLO study of LHC jet cross sections. The situation is slightly more complex if 
we want to use the BlackHat-i-Sherpa ntuples to compute predictions beyond NLO using LoopSim. This 
is because the cut that is present in the W + 2-jet BlackHat-i-Sherpa NLO sample eliminates part of the 
real contribution to the W + 1-jet phase space at NNLO, for example W + 3-parton events in which the 
3 partons all form part of a single jet, or in which 2 partons form part of one jet, while the third is well 
separated in angle but below the 20 GeV jet threshold. 

Since we plan to use LoopSim interfaced to BlackHat-i-Sherpa ntuples in our future study of multi- 
jet processes, it is important to directly check the effect of the finite generation pt cut, pf^^^, on the 
predictions of the pt and Ht distributions. We have performed such a study for W^ + 1 jet generated 
with MCFM, where we varied a 'parton'-pt generation cut from 1 to 20 GeVJ^ This is not entirely 
equivalent to the cut in the BlackHat-i-Sherpa samples (which is applied to the standard jets, not to the 
partons), but should be adequate from the point of view of estimating the potential order of magnitude of 
finite generation cuts. The output from MCFM was interfaced to LoopSim which produced the additional 
loop diagrams. Then, the events were analyzed with the following set of cuts: |yi^pt°'i| < 2.5, p^"^ ""^ > 
20 GeV , lyj''*! < 4.5, p!^^ > 25 GeV , 'mT,w < 20 GeV, where the anti-A;T algorithm with R = 0.4 was 
used for clustering. 



The results are presented in Figure 52 where the ratios of cross sections obtained with a range of 
generation cuts are shown as functions of the pt of the leading jet and ffy jets- At NLO, the only artefact 
we see is for the pf^l^n of 20 GeV in the bin below 40 GeV. This is as expected, since a 20 GeV cut 
on each of two partons can at most affect jets up to 40 GeV (such an artefact would not be present in 
the BlackHat-i-Sherpa samples). At nLO and nNLO, however, the dependence on p^gen is extended to 
a larger range of Pf,icad.jct/^T,jcts and it is visible also for values of p™™ < 20 GeV. However, even 
if the p™gcn dependence of the nLO and nNLO results is stronger than at NLO, it dies out quickly with 
increasing Pi,iead.jct/^T,jets and becomes irrelevant at ~ 100 GeV, depending on the observable and the 



'^Strictly speaking, the events destined for the W + n-jet sample were clustered with the kr algorithm with a very small 
separation, R — 0.03, and accepted when at least n small-_R jets were above the generation cut. These events were then passed 
through the standard analysis cuts (with or without LoopSim). 
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Fig. 52: Ratios of cross sections from runs with a certain range of pf^^^ values taken wrt. the cross 
section generated with p™gen = 1 GeV for the distributions of pt of the leading jet (left) and the scalar 
sum of jets' transverse momenta, -f^rjcts (right). 



order. This appears to be consistent with the expectation that the effect of the cut should vanish as a 

power of p^^JJ^/pi^iead.jet Or pf^^^J HT,jets- 

Therefore we conclude, that in spite of the finite generation cut one should be able to trust the 
results obtained using BlackHat-i-Sherpa ntuples, above a moderate pt limit, even for more complex 
analyses such as those involving LoopSim. 



16.6 Comparisons to data, Sherpa and HEJ predictions 



In Figure 53 (left), we compare the ratio of the 2010 ATLAS W plus jets data for the Ht distribution 
for W+ > 1 jets to predictions using the generic NLO calculation for W+ > 1 jet, the Exclusive 
Sums approach adding up to 4 jets at NLO and the Monte Carlo event generator Sherpa. The agreement 
between the data and the pure NLO result is rather poor; it improves substantially with the inclusion 
of the Exclusive sums up to two jets at NLO, with further small improvements coming from higher 
multiplicities. As a reminder, we previously noted that the scale dependence improved when adding 
the 2-jet NLO information, but degraded when adding higher jet multiplicities. The Sherpa prediction 
slightly overshoots the data for Ht in the inclusive W + 1-jet bin. We however note that the data versus 
Sherpa Ht ratio has been formed based on the absolute normalization as given by the Monte Carlo 
simulation. Comparing the inclusive 1-jet cross sections, we find a factor of 0.97 between the data and 
the Sherpa result. 



In Figure 53 (right), we compare the ratio of the 2010 ATLAS W plus jets data for the Ht distri- 
bution for W+ > 2 jets to predictions using the generic NLO calculation for VF+ > 2 jet, the Exclusive 
Sums approach adding up to 4 jets at NLO, and to predictions from HEJ and from Sherpa. As noted 
previously, there is some increase in the predictions from the Exclusive Sums approach at the highest 
Ht values, but not nearly as much as in the 1^+ > 1-jet case. These increases go in the direction of 
closer agreement with the data, but the statistical error does not allow a clear judgement to be made. The 
Sherpa and HEJ predictions for this ratio are in reasonable agreement with the data but appear to fall 
off somewhat more rapidly at large Ht than either the data or the various BlackHat-i-Sherpa predictions. 
Again this partly is the result of relying on the absolutely normalized Monte Carlo predictions, which 
yield W+ > 2-jet normalization factors of 0.95 or 0.93 between data and Sherpa or HEJ, respectively. 
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Fig. 53: The ratios of the ATLAS W+ > 1-jet (left) and W+ > 2-jet (right) cross sections, as a 
function of Ht, taken wrt. various theory predictions. The absolute normalization has been kept as given 
by the calculations. The error bars represent the total fractional error (statistical plus systematic added in 
quadrature) at each point. 



1.^ 




; 


Data fraction >=2 
Data fraction >=3 






1 


- 


■ 


Data fraction >=4 
Blaclfhat+Sherpa fractic 
Blaclfhat+Sherpa fractic 
Biacl(hat+Sherpa fractic 
Sherpa fraction >=2 
Sherpa fraction >=3 
Sherpa fraction >=4 


n>=2 

n>=3 
n>=4 




1 






0.8 


- 




T \ 










c 

fo.6 

S 


- 




i i 


* 








0.4 


- 






t 


















» 






' I 1 








1 






T 




0.2 


- 






f 1 












if 


1 1 




n 


]_ 


ul 


i.i jl * i T 1 1 


1 


1 1 1 1 1 1 1 1 1 1 1 1 1 


, 1 , , 



100 200 300 400 500 600 700 
H,(GeV) 



0.8 



0.4 



0.2 



A Data traction >=3/>=2 

A B laclth at + Sherpa exclusive fraction >=3/>=2 

o Sherpa traction >=3/>=2 

n HEJ fraction >=3/>=2 



100 200 300 400 500 600 700 
H,{GeV) 



Fig. 54: (left) The fraction of the Ht cross section for W+ > 1-jet events arising from the W+ > 2-jet, 
W+ > 3-jet and W+ > 4-jet final states derived from the Exclusive Sums approach, from Sherpa and 
from HEJ, compared to the 2010 ATLAS data, (right) The ratio of the cross sections for W+ > 3 jets to 
W+ > 2 jets, as a function of Ht, using predictions from the Exclusive Sums approach, from Sherpa 
and from HEJ, compared to the ratio from the 2010 ATLAS data. 



In Figure[54](left), we show the predictions for the fractions of the Ht cross section in the inclusive 
W + 1-jet bin arising from the inclusive W + 2-jet, W + 3-jet and W + 4-jet final states as obtained 
from the Exclusive Sums approach and from Sherpa, compared to the 2010 ATLAS data. In Figure [54 
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(right), we show the ratio of the cross sections for W+ > 3 jets to W+ > 2 jets, as a function of 
Ht, again using predictions from the Exclusive Sums approach and from Sherpa but also from HEJ. We 
again compare to the ratio given by the 2010 ATLAS data. All three predictions agree with each other 
and with the data over the range considered, despite the big differences in the approaches. There may be 
an indication of some separation between the predictions at the very highest Ht values. 

16.7 Conclusions, outlook and future studies 

The advances achieved over the last few years in calculating NLO corrections for multi-jet final states 
allow a more serious consideration of the possibility to combine various n-jet NLO predictions into an 
inclusive jet sample. The Exclusive Sums approach discussed in this contribution is a first promising step 
into this direction. More studies are required to understand the uncertainties related to this procedure. 
One way of doing so would be to test the stability of the predictions against variation of the jet algorithm 
and/or parameters of the jet algorithm used to obtain and separate the different NLO predictions for the 
fixed-multiplicity sets that eventually make up the sum of exclusive n-jet contributions p^ 

For the Exclusive Sums approach, outlined here for the case of W+ > 1 jets, contributions are 
added proportional to a'^ (W + I jet at NLO), a^ (W + 2 jets at NLO), aj{W + 3 jets at NLO) and 
al (W + 4 jets at NLO), i.e. this procedure mixes powers of Os and thus is missing essential Sudakov 
form factors that effectively bring each term to the same power of ag. One could imagine accomplishing 
this by embedding the NLO matrix elements in a parton shower Monte Carlo framework, however the 
technology for merging different multiplicities of NLO calculations with a parton shower is still under 
development. Note that at LO the tree-level matrix-element plus parton-shower merging methods (e.g. 
as implemented in Sherpa) are designed to satisfy this same-C'(as) requirement by including the (all- 
orders) leading-log effects to the 'LO Exclusive Sums' exhibiting the LO analog of the Exclusive Sums 
discussed here. Compared to the matrix-element plus parton-shower merging, we see that the 'NLO 
Exclusive Sums' technique only accounts for Sudakov effects up to 0{as) while it describes each jet bin 
at full NLO instead of LO accuracy. 

Relying on the parton shower Monte Carlo framework is not the only way to go in refining the 
Exclusive Sums strategy. Alternatively, the LoopSim method can be used to provide approximations to 
the higher-loop terms missing in the Exclusive Sums approach. As we have seen here, prospects for 
using it together with BlackHat-i-Sherpa ntuples seem promising. A detailed comparison of the LoopSim 
results to LHC data is however beyond the scope of this Les Houches contribution, though we look 
forward to it being carried out in the near future. 

The ATLAS data taken in 2011 is about a factor of 130 times as large as the data taken in 2010 
(the only published data for W plus jets so far). This will allow a much further reach in all kinematic 



variables. To get an idea, we show in Figure 55 the ratio of the predictions from the Exclusive Sums 
to the respective inclusive NLO predictions for W+ > 1, 2, 3 jets. At an Ht value of 2 TeV, the ratio 
foTW> 1 jet is of the order of 2; the ratio for 1^ > 2 jets rises to about 1.4. The NLO-to-LO K- 
factor for W+ > 1 jet rises rapidly with increasing Ht, while the iC-factor for W+ > 2 jets increases 
only moderately (because no new subprocesses are being introduced). It will be interesting to see if (a) 
the additional factor of 2 (for the W+ > 1-jet case) and (b) the additional factor of approximately 1.4 
(for the W+ > 2 jet case) lead to better agreement with the data. The LHC data from 2011 (and the 
higher statistics expected in 2012) will reach these kinematic values and should shed further light on 
the necessity and the efficacy of this theoretical technique, not only for W+ > 1 jet, but for higher jet 
multiplicities as well. 



^^In this study, the same jet definition (anti-fcr with R = 0.4) was used for both establishing the separation of the fixed 
jet-multiplicity contributions and evaluating the cuts and observables. 
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Fig. 55: The ratio of the predictions obtained from the NLO Exclusive Sums approach to the inclusive 
NLO predictions ior W > 1 jets, W > 2 jets and W > 3 jets. The 'jitter' is due to the limited 
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Appendix: a double logarithmic analysis of the Exclusive Sums method 

To help understand the structure of the Exclusive Sums method, it can be useful to consider how it works 
in a simple double logarithmic approximation. We use pt,min to represent the minimum pt for the jets 
in the Exclusive Sums sample, and first study the cross section for W production as a function of pt^w 
at high pt^w (^ mw), considering in particular the terms that go as a^L?'^ where L = hipt^w /Pt,ram- 
The 0-jet sample does not contribute at all to non-zero pt;w^ so the first term comes from the exclusive 
1-jet contribution. If calculated to all orders in the double logarithmic approximation (DLA), it would 
have the form 



LO/ 



0-l,excl(Pi,W^) = 0-1 {pt,w) exp 



2Ca, 



-L^ 



IT 



(100) 
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where C = 2Cf + Ca for the (dominant) qg — )• W^q' scattering process. The n exclusive jet rate 
would be given by 

and one sees that the sum over all multiplicities is given by 

oo 

^{Pmf^^ = E<excl(Pt,H^) = '^\''iPt,w), (102) 

n=l 

i.e. in the double logarithmic approximation, there are no corrections to the pt^w distribution at high 
Pt,w- Now let us consider what happens if we expand each of the exclusive sums to NLO. For the n-jet 
cross section, we have 

Performing the sum over n, which corresponds to summing an infinite tower of NLO exclusive jet cal- 
culations, leads to 

oo 
n=l 

= cr\%t,w) exp (^^') (l - ^^') (104b) 

= a\^{pt,w) (l - I (^-^^L^) ' + OialL')] . (104c) 



As long as L^ is not large, the difference between this and the correct answer of Eq. ( 102 1 is a straightfor- 
ward NNLO correction, i.e. small. However whenpt VK S> Pt,mm the logarithms become large, the a^L^ 
term can be of order 1 and the Exclusive Sums method may then no longer be a good approximation. 
A similar analysis can be performed for an exclusive sum truncated at some finite order, as used in our 
study. 

Given the above discussion, one may wonder then if there are any circumstances in which the 
Exclusive Sums method will bring benefits. For the observable studied in this contribution, Ht, the 
key difference with respect to pt^w is that it is subject to a 'giant' A'-factor at NLO. This phenomenon is 
associated with 'dijet' topologies in which a soft or collinear W is radiated off the dijet system, leading to 
a double logarithmic (electroweak) enhancement. In addition these topologies can be created by qq type 
scattering (whereas the LO process involves only gq or qcf scattering), leading to further enhancement 
in pp collisions at large Ht- Dijet type topologies contribute significantly to the Ht distribution, even 
when the W is soft, because the variable sums all particles' transverse momenta (whereas the softness of 
the W limits these topologies' contribution to the pt^w distribution). 

Because of the giant K-factor, for the Ht variable the behaviour of the Exclusive Sums method 
is more subtle than ior pt^w- while the crj^{^2 contributions destabilize the prediction for the qg — ;■ Wq' 
type topologies, they instead stabilize the prediction for the much larger qq — )■ Wq'q topologies (present 
only at LO in a NLO W + 1-jet calculation). Going further in the exclusive sum, however, i.e. including 
o"Jy"^3 and o-J^°4 contributions can however destabilize the predictions for both kinds of topologies. 
Traces of this behaviour were visible in the numerical studies shown above. 
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17. W PRODUCTION IN ASSOCIATION WITH MULTIPLE JETS AT THE LHC 

Abstract 

We compare the results from four different theoretical predictions for the pro- 
duction of a VF boson in association with at least two jets at the Large Hadron 
Collider. We discuss a possible method for combining next-to-leading order 
samples with different jet multiplicity from BlackHat-i-Sherpa. We then 
compare these results with the next-to-leading order W plus two jet calcula- 
tion, the leading order ME&TS merged approach of Sherpa and the high- 
energy resummation approach of HIGH ENERGY JETS in an attempt to deter- 
mine if these approaches can be distinguished at the LHC. 

17.1 INTRODUCTION 

The production of a VF boson in association with jets at the Large Hadron Collider (LHC) is an extremely 
important process. It contributes to three distinct areas of the rich physics program at the LHC. Firstly, it 
is a key Standard Model signal and therefore important to test our understanding of the Standard Model in 
the TeV-scale energy range. Secondly, it is an important background in many searches for new physics 
where, for example, new heavy coloured particles have cascade decay chains. Thirdly, it provides an 
ideal testing ground for experimental techniques such as a jet veto: what is learned in the relatively 
well-understood treatment of W plus jets can be directly applied to Higgs searches for example. 

It has been observed that the ratio of M^ + (n + 1 ) -jet events to T^ + n-jet events can be substantially 
larger than one might naively expect by considering the q^ suppression only. This is especially true in 
phase-space regions of large four-momenta, such as the high-ffy tail, because the available phase space 
for extra jet emission at the LHC is extremely large. It can therefore compensate for the effect of an 
additional factor of the strong coupling. This effect is more visible in distributions where additional 
radiation leads to a significant change in the value of the observable, as is the case for the Ht distribution, 
the scalar sum of the transverse momenta of identified leptons, jets and missing energy. The change will 
be more moderate in an observable like Ht,2, whose definition differs from that of Ht by truncating the 
jet sum to include only the two hardest jets in the event. To make an impact here requires the radiation 
to lead to an additional jet with transverse momentum as large as that of the second hardest jet, not only 
larger than the jet pT threshold. The effects will also be smaller in more inclusive variables like the 
transverse momentum of the W boson, pt,w^ or the leading jet, ptji ■ 

There are a number of different theoretical approaches to describing the emission of large numbers 
of jets. In order to probe to what extent the differences in these will be accessible at the LHC, we will 
compare, in this study, the predictions for the jet activity in inclusive W{^ ev) + 2-jet production 
from (a) BlackHat-i-Sherpa {BHS) [TOllIIll^l, (b) combined BHS samples (to be described below), 
(c) Sherpa |[T4i|425l run in ME&TS mode (S-MEPS) EH |427l gH and (d) High Energy Jets 
(HEJ), an all-order resummation of wide-angle radiation Il4181l419ll438ll . 

The current state-of-the-art next-to-leading order (NLO) predictions for W production in associa- 
tion with jets are those of BHS, which have been calculated up to W plus four jets with a leading-colour 
approximation for the virtual part 1221 . and up to W plus three jets with a full color treatment ITTOIISTI . 
In this study, we consider the inclusive W + 2-jet prediction at NLO accuracy, and further, discuss and 
show predictions from an inclusive sample where TV + 2, 3, 4-jet events generated by BHS are combined 
in a simple manner, nevertheless without introducing any double counting of phase-space regions. 

The S-MEPS predictions are obtained from merging at leading order (LO) tree-level Matrix Ele- 
ments ioT W + 0, . . . , n-parton final states with (Truncated) parton Showers (hence the name ME&TS) 
preserving the leading logarithmic accuracy to which soft and coUinear multiple emissions are described 
by the parton shower. The newer ME&TS merging scheme was introduced in Ref. II426II and optimised 
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as documented in Refs. II427I I428ll to improve over the original Sherpa implementation based on the 
CKKW approach 1 14291 1430) 1 . ME&TS guarantees a better matching regarding the usage of scales as oc- 
curring in the evaluation of the matrix elements and those scales driving parton showering. The S-MEPS 
sample used in our study was generated by including W{^ ev) production matrix elements with up to 
five extra partons (massless quarks, u, d, s, c, b, and gluons). 

The HEJ framework is a resummation of the leading logarithmic terms occurrung in pure, or W, 
Z ox H plus, multi-jet production in the limit of large invariant mass between each pair of jets, to all 
orders in a^. This is then matched to tree-level accuracy for final states with two, three or four jets. In 
principle, the HEJ framework can be merged with a parton shower to add the coUinear pieces which are 
not included in the HEJ description {HEJ does include soft emissions down to around 2 GeV). First steps 
in this direction for pure jet production were taken in 143 lil . Here, the HEJ predictions are calculated at 
the parton level. 



In the 17.2 section, we will elaborate on the method (b) for combining NLO samples of different 



jet multiplicities. Then, in the 17.3 section, we will first show explicit results of the sizable impact of 
large multiplicity events by comparing predictions from the combined BHS sample and the S-MEPS 
merged sample. Secondly, we will study variables chosen to probe the differences in the treatment of 
the QCD radiation. We will show and compare the predictions for all four descriptions mentioned above 
focusing on the following observables: 

• the average number of jets as a function of Ht = X^j PTji + PT,e + Pt,u and Ay, the rapidity 
difference between the most forward and most backward jets, and also 

• the ratio of the inclusive 3-jet rate to the inclusive 2-jet rate as a function of Ht and Z^y^J 

We will then discuss the areas of agreement and difference that we find, before we finally conclude in 
the 1 17.41 section. 

17.2 NLO EXCLUSIVE SUMS 

An NLO n-jet prediction contains events with n or n + 1 partons. For observables for which higher 
multiplicites have a significant impact, this limitation can be detrimental. If one has predictions for 
different multiplicities (m, m + 1, . . . , M), one can try to combine them by avoiding double counting by 
requiring that the n-jet prediction is used only to describe n-jet events (except for the highest multiplicity 
where (n+l)-jets configurations are allowed). The total cross section can be rewritten as a decomposition 
based on exclusive (exc) and inclusive (inc) jet bins: 



A/-1 






E <" + < • (105) 



The exclusive-sums procedure describes each jet bin at NLO accuracy, i.e. at 0{a^^^), or, alternatively, 
only the ( Af + l)-th (inclusive) jet bin is predicted with LO precision. We hence note that the combination 
of the terms shown in Eq. ( 105| ) occurs at different orders of the strong coupling. Furthermore, the 



definition of an exclusive n-jet sample requires a detailed treatment of jet vetoing. For these reasons, 
the simple combination procedure is crude and does not increase the formal accuracy of the prediction, 
which is that of NLO of the smallest multiplicity. However, one can hope that the procedure will lead to 
a better prediction in observables where higher multiplicity events dominate. 

More studies are required to understand the uncertainties related to this procedure. One way of 
doing so would be to vary the jet algorithm and/or parameters of the jet algorithm used to separate the 
different NLO predictions into fixed multiplicities sets and test the stability of the prediction]^ This is 
left to a future study. 



^^Given the definition of Ht, we note Ht,2 ~ Ptji + Pt,j2 + PT,e + Vt,u- 

^*In tiiis study we used tiie same jet algoritiim for both defining the partitioning in jet muhiplicities as well as applying cuts 
and determining observables. 
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\Ve\ < 2.5 
M_L,vK > 20 GeV 
\r,j\ < 4.5 



PT,e > 20 GeV 
PT,u > 20 GeV 
PT,j > 25 GeV 



Table 12: Summary of the cuts applied in the analysis. 
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Fig. 56: The average number of jets as a function of pT,w (left) and prji (right). The pt,w plot shows 
the BHS exclusive sums prediction, while the prji plot is obtained from S-MEPS. 



17.3 RESULTS OF THE COMPARISON 

In this section, we compare the results of different theoretical descriptions for W + n-jets production at 
the LHC. The number n can take values from 2 and above, as we will mostly consider inclusive samples. 
The four descriptions, which we will compare here in more detail, are 

• the BHS calculation ofW + 2-jets at NLO, 

• the combined sample oiW + 2, 3, 4-jet events at NLO from BHS, as described in the 17.2 section, 

• the S-MEPS merged W + n-jets sample using LO tree-level matrix elements up to n = 5, and 

• the approach of HEJ. 

Throughout this study, we will consider inclusive samples of W~ boson production in association with 
at least two hard jets identified by the anti-fcy jet algorithm using R = OA. The jets are required to have 
PT,j > 25 GeV. We look only in the W~ 
M±^w is defined as M^^w = ■\/{\pt,i 



e Ue decay channel and use the cuts given in Tab. 



12 



where 



+ \PT,u\)'^ - {PT,e+PT,vY- 

The HEJ predictions use the geometric mean of the jet transverse momenta to determine the renor- 
malisation and factorisation scale, i.e. (W pt,j) ■ This central choice will be varied by a factor of two 
in either direction to provide an envelope (marked by dotted lines in the corresponding figures) around 
the HEJ default prediction. The BHS predictions instead use H'rp/2 as the NLO calculation becomes 
unstable for a scale which is too low. In the S-MEPS calculation, scales are chosen according to the 
default prescription given by ME&TS 114261 . 

The variables Ht,2, Pt,w and prji are less sensitive to the presence of additional radiation than 



Ht, as discussed in the introduction. The plots, which we present in Figs. 56 and 57 address the al- 
ternative question: given a particular value of Ht, -f^r,2 etc. how many jets are typically found in the 
event? 



Figs. 56 and 57 show the stacked results for the average number of jets as a function of pT,w^ PtJi > 
Ht and Ht,2 visualising the contributions from each exclusive 2, 3, 4-jet sample and the inclusive 5-jet 
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Fig. 57: The contribution from different multiplicities to the average number of jets as a function of Ht 
and Ht,2- The upper plots show the BHS exclusive sums prediction, while the lower ones are extracted 
from S-MEPS. 



sample. The left (right) plot in Fig. 56 and the upper (lower) rows of plots in Fig. [57] depict the results 
as obtained from the combined BHS sample (the S-MEPS sample). In all cases the different colours 
correspond to the terms in the numerator of the formula for the average number of jets, 
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where blue, green, red and magenta stand for i = 2,3,4: and i = 5, respectively. The subscript to (N) 
clarifies that we truncate the determination of the average after the fifth jet bin, noting that {N)f^ — )• (N) 
for a sufficiently large number of jet bins. This makes no difference for the BHS predictions employed 
here since the jet multiplicity de facto is limited to five, but it does for the S-MEPS and HEJ computations 
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where events with i > 5 jets do occur. We have defined n^ 
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da'^f'^''"^ /dO where O denotes an 



observable like Ht, or Ay presented later on. Note that in Fig. 57 the 5-jet part contributes to the average 
number of jets with a factor of 5, while the 2-jet part, for example, contributes with a factor of 2 only. 



The layout of Fig. 58 (including the colour coding) is the same as before: here, we however 
display, wrt. n™'^, the relative fractions of the different multiplicities corresponding to the terms in the 
denominator of Eq. ( |106 l. In other words, in Fig. [58] we consider the partitioning of 
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Fig. 58: The fraction of the total rate from different multiplicities as a function of Ht and Ht,2- The 
upper plots show the BHS exclusive sums prediction, while the lower ones are extracted from S-MEPS. 



Although there is just a 30% fraction of inclusive 5-jet events to the total cross section, we observe that 
their contribution to the build-up of {N){Ht) for very large Ht gets close to 50%. Also, for an Ht ~ 
500 GeV, the average number of jets is composed evenly between the 2, 3-jet and 4, 5-jet contributions, 
while the relative fraction of the 2, 3-jet events is nearly 70%. This emphasizes the dominance of multi- 
jet events in forming large Ht values. It also can be seen that for medium Ht values, 400 < Ht < 
700 GeV, all the multiplicities give roughly the same contribution to the variable {N){Ht), while for 
low Ht, the average is primarily described by 2-jet events. 



Going clockwise through Figs. 56 and 57 we see that the average number of jets is indeed sensitive 
to higher multiplicities when considered as a function oipT,w, PtJi and Ht,2, but in all these cases this 
happens to a lesser extent as if considered as a function of Ht- As expected, the dependence is mildest 
for pt,w, the most inclusive observable studied here. We also observe that the jet-bin decomposition 
of PT,ji and Ht,2 turns out very similar. Most strikingly we note the increase in the contribution from 
the highest multiplicity events, the ones containing more or at least five jets. For Ht,2, we furthermore 



display to the right of Fig. 58 the relative fractions as done in the Ht case. Even for largest Ht,2 values. 



the fraction arising from 2, 3-jet events remains close to 65% stressing once more the lower sensitivity 
of Ht,2 versus Ht regarding multiple jet production. 

Finally, we compare the plots from the combined BHS samples in all figures to the correspond- 
ing ones generated with the S-MEPS sample. Interestingly, the outcome looks very similar although 



ME&TS handles the single terms in Eq. ( 105 1 rather differently. They are calculated at least at leading 
(soft/coUinear) logarithmic accuracy improved by LO n-jet effects. Presumably, for the exclusive jet 
bins, this description (which allows a better treatment of jet vetoes) is not too far off the exclusive sums 
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Fig. 59: Average number of jets as a function of Ht (left) and Ay (right) in two BHS descriptions, from 
HEJ and from S-MEPS, the latter using the {N)j definition. The bands shown with dotted lines for the 
HEJ prediction are a result of varying the scale by a factor of 2 in each direction. 



approach, since the unresolved 0{as) corrections are also present in the Sudakov form factors applied 
in the ME&TS approach. Also, the combined BHS samples as well as the S-MEPS sample use the same 
tree-level matrix elements, namely up to VF + 5-parton matrix elements. Clearly, it has to be studied 
further whether this similarity in the results is a coincidence or not. 

It is clear that the impact of the higher multiplicity samples is significant throughout, especially 
in the high Ht tail. This is precisely the region, which would be probed for signs of new physics, and 
therefore it is essential that we fully understand our theoretical descriptions in this region. This is the 
subject of the remainder of this contribution, where we compare all four different methods of modelling 
hard QCD radiation in inclusive W + 2-jet events. 



The left plot of Fig. 59 shows the final comparison plot between the exclusive sums and inclusive 



2-jet BHS results as well as the HEJ and S-MEPS predictions for the average number of jets as a function 
of Ht- The differences in the descriptions are significantly larger than the scale uncertainty band on the 
HEJ prediction. For the W + 2-jet NLO result, the number of jets rises to 2.6 already at Ht = 500 GeV 
but that levels off significantly below the S-MEPS, exclusive BHS sum and HEJ results. The HEJ results 
level off at a higher value of about 3.0, starting to clearly disagree with the exclusive sums and S-MEPS 
predictions above 500 GeV, from where those two curves keep rising to a final level of around 3.7 to 



4.0. The S-MEPS comes in highest at largest Ht, where {N)j is shown, cf. Eq. (106 1, in order to 



determine the average number of jets for this S-MEPS result. The reason for giving slightly higher (N) 
than the exclusive sums lies in the contribution of additional parton-shower jets present in the S-MEPS 
calculation and more accurately accounted for by the use of the {N)^ definition as compared to the earlier 



result based on (A^)5 presented in Fig. 57 to the lower left. 



In the right panel of Fig.|59j we have plotted the average number of jets as a function of the rapidity 
span. Ay, instead of Ht as before. Again the differences are larger than the scale variation shown on the 



HEJ result, but the ordering is different to that of the left plot of Fig. 59 All four descriptions increase 
linearly with Ay but the gradient is steepest for the HEJ predictions where the average rises above 3.0 
for Ay values as large as 6.0. The BHS exclusive sum result is consistently below this, reaching about 
2.8 at Ay = 6.0, and agrees pretty well with the S-MEPS result based on {N)t. The NLO W + 2-jet 
prediction given by BHS is lower still, between 2.4 and 2.5 for Ay ~ 5.0. 

It may seem surprising that on the plot on the left-hand side the exclusive sums and S-MEPS lie 
higher for most of the distribution whereas on the right-hand side these approaches as well as HEJ give 
predicitions that are commensurate. The region of high Ht and that of high Ay however are largely 
distinct as it is very expensive to have both a large rapidity and large pT for the jets. Also while radiating 
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Fig. 60: The ratio of the inclusive 3-jet and 2-jet rates in the inclusive W + 2-jet NLO and exclusive 
sum description of BHS as well as in the S-MEPS and HEJ approaches as a function of Ht (left) and Ay 
(right). Again, the dotted lines indicate the uncertainty band from varying the scale in HEJ by a factor 
of 2 in each direction. 



an additional jet automatically moves an event towards the higher Ht direction, radiating an additional 
jet tends to not change the rapidity difference. So, we expect the higher multiplicies to have a smaller 
effect on the average number of jets as a function of Ay compared to as a function of Ht- This is indeed 
the case in Fig. [59l 

Lastly, in Fig. 60 we plot the ratio of the inclusive 3-jet to the inclusive 2-jet rate as a 
function of Ht (left) and Ay (right), again for all four descriptions used here. The predicted 
{daf''/dHT)/{daf''/dHT) all agree very well below 400 GeV. The fixed order BHS result for W" + 2 



jets is highest for large Ht, however is known to become unreliable here, since the probability that an 
inclusive 2-jet event is at least a 3-jet event turns too large, being in conflict with the expected behaviour 
of an 0{as) correction. The BHS exclusive sums, the S-MEPS and the HEJ results, in this order, level off 
considerably lower with the HEJ fraction staying below 60% to 70%, which leaves the other predictions 
again above the HEJ uncertainty envelope. In contrast, when the same ratio of jet rates is plotted against 
Ay, the HEJ prediction is consistently higher throughout. This again emphasises that differences in the 
descriptions come to light in different kinematic regions. However, in both cases here the magnitude 
of the differences is relatively small and would be rather difficult to distinguish in present experimental 
data. 

17.4 CONCLUSIONS 

We have compared a number of theoretical descriptions of W~ production in association with at least two 
jets. After outlining one possible method of combining NLO calculations of different multiplicities, we 
compared this with a pure NLO calculation of VF + 2-jets production obtained by BlackHat-i-Sherpa, 
a sample of leading-order events merged using the ME&TS method of Sherpa, and the high-energy 
resummation of the HEJ framework. 

We studied the average number of jets and the ratio of the 3-jet and 2-jet inclusive cross sections 
as a function of Ay and of Ht- We find, with these simple cuts, some clear differences in the predictions 
when we study the average number of jets as a function of both Ay and Ht- Smaller differences, which 
would be more difficult to disentangle experimentally, are found when we study the ratio of inclusive 
rates. 

It would be very valuable to have an experimental study, which probed the average number of jets 
in W production in association with at least two jets, to test our different descriptions of these important 
Standard Model processes. 
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18. UNCERTAINTIES IN THE SIMULATION OF W+ JETS - A CASE STUDY [^ 

Abstract 

In this contribution, uncertainties in the simulation of a large variety of observ- 
ables related to the production of W in association with jets at the LHC and 
the Tevatron are discussed. This work aims to 

• serve as a compendium of currently publicly accessible tools in addition 
to the ones presented in a previous publication II439II with a similar topic, 
and to compare their results; 

• discuss the origin and generic size of various uncertainties in the simula- 
tion of perturbative and non-perturbative aspects of this process; 

• trace the interplay of these uncertainties in various stages of the full event 
simulation; 

• hint at those uncertainties in each of the various tools considered here 
which the respective authors find relevant; 

• guide their users in how to assess the related uncertainties in a way the 
authors recommend. 

18.1 Introduction 

The production of VF-bosons in association with jets constitutes an important process at the Tevatron 
and the LHC, for a variety of reasons. First of all, it represents a major background to Standard Model 
signatures such as top-pair and single-top production,and it also plays a role in searches for the Higgs 
boson in the Standard Model. Furthermore, this reaction, together with the fairly similar channel of Z- 
production in association with jets, provides one of the most important backgrounds in those searches for 
new physics where large missing transverse energy and high jet multiplicities characterise the respective 
signal. Thirdly, this process has become a standard reaction for QCD studies at hadron colliders, rang- 
ing from the validation of simulation tools for multijet signatures to measurements related to multiple 
parton scattering. Finally, this process also provides one of the main testbeds for novel techniques in 
the automation of higher-order QCD corrections and their matching or merging with subsequent parton 
showers in the framework of event generators. 

In the spirit of this last point, providing a testbed for the combination of fixed order calculations 
with the parton shower, this process has been analysed in quite some depth in II439II about five years ago. 
A number of reasons provide motivation to update and extend this previous study, namely 

• the LHC being up and running and starting to provide highly precise data such that a proper 
treatment of uncertainties becomes an important issue; 

• major improvements in the ability to calculate higher-order corrections including up to four jets in 
the final state accompanying the W bosons ISTl II17I l22i ; 

• the advent of such next-to leading order calculations - albeit for lower final state multiplicities - 
fully matched to the parton shower ||440[ |404[ l424l ; 

• an improved understanding of the leading order merging prescription for towers of multijet multi- 
plicities with the parton shower 1.426. .441.1 ; 
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• the combination of matching and merging methods 114421 14431 ; 

• and new methods to simulate multijet topologies based on the high-energy limit ll64ll . 

Therefore this study aims at being a first step towards a more complete update of 114391 . with a shift 
in focus towards a discussion of theoretical uncertainties in different approximations, including pertur- 
bative and non-perturbative effects. Apart from tracing the origin and determining the generic size of 
various uncertainties in the theoretical description of various observables related to this process, also the 
interplay of them at various stages of the simulation, from the matrix element to the hadron level will be 
discussed. Consequently, the most important causes for theory uncertainties in various tools are high- 
lighted. Therefore, one of the more practically relevant goals is to also provide methods to reliably and 
robustly estimate such uncertainties for the various tools used in this study, as recommended by authors 
or users. 

The outline is as follows: After briefly presenting the various tools included in this study and 
discussing the way they have been used here in Sec. 18. 2[ example results for them will be presented 



individually, tool by tool in Sec. 18.3 In Sec. 18.4 these results are compared in order to see and quantify 
relative differences. In this endeavour, experimental results have not yet been included. We reserve this 
comparison with relevant data for a later, full-fledged analysis, which will hopefully include even more 
tools. 

18.2 Codes 

In this work a variety of different codes has been employed, which allow to study the process at various 
different stages: 

1. Fixed order matrix elements: 

By now, the description of W boson production in association with jets is possible for up to 4 
additional jets at NLO. Here, results from two NLO codes, GoSam-i-Sherpa |[T2l 14251 1 1461 and 
BlackHat-i-Sherpa ||70l|5Tl|22l, which are either publicly available or provide publicly available 
event files, are presented. The corresponding results therefore are on the matrix element level. 

2. All-order resummed matrix elements: 

Approximations to the partonic matrix elements for the processes of n-jet production, and 
W, Z,H + n-jets, n > 2, was recently calculated to any multiplicity, and including all-order 
resummations for the leading virtual corrections. The all-order scheme 1418114191 . implemented 
in the Hej | 64 1 code, becomes exact in the limit of large invariant mass between each parton (the 
MRK limit of BFKL). The resummation scheme is merged with LO matrix elements (much like 
in MePs, see later). The resummation of Hej can also be interfaced to a parton shower 114311 ; the 
results presented here, however, are on the matrix element level. It should also be stressed that due 
to the nature of the approximation of Hej, the simulation here are relevant for the production of at 
least two jets in addition to the W boson. 

3. Parton showers: 

The pure parton shower code relies on the coUinear approximation to produce additional jets. 
By using a matrix element reweighting, however, in the process of W production, typically one 
additional jet can correctly be described. For this simulation, PYTHIA8 [348] has been used here, 
with results available on the parton shower level, hadron level and hadron level including UE. 

4. LO matrix elements merged with the parton shower (MePs): 

By now, the use of towers of multijet matrix elements with increasing multiplicity merged to 
the parton shower following ideas presented in 114441 14291 14451 14301 is common practise in the 
experimental collaborations. In fact, a first comparison of different codes and implementation has 
been presented a while ago 14391 . Here, three implementations of these ideas are included, namely 
the ones in MadGraph-i-Pythia II1651I4461[T631I1911|4001 . Pythia8-i-ME 1)4411 and Sherpa |[T46ll . 
Here results are available on all levels matrix element level, parton shower level, hadron level. 
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hadron level including UE, and hadron level including UE and QED final state radiation in different 
combinations of codes. 
5. NLO matrix elements matched to the parton shower (NLO(g)PS and MeNloPs): 

In principle two methods by now have been proposed and fully implemented which consistently 
match full NLO calculations to the parton shower, namely Mc@Nlo 1140111 and POWHEG II416I 
14171 . Here the latter is being used, with its implementation in the PowHEG Box |[T3l . and in- 
terfaced to the Pythia pool parton shower in its A;r-ordered version II447II . In addition, a com- 
bination of such matching with the merging methods described in the previous point is avail- 
able II442I 14431 . ranging under the name MeNloPS. In this paper we use an implementation of 
such methods provided in the Sherpa framework. In both cases, results are available on all levels 
matrix element level, parton shower level, hadron level, and hadron level including UE. 

18.21 BlackHat + Sherpa 

The NLO predictions are obtained by combining BlackHat [4J for the virtual part and Sherpa II147I 
1448 II for the real part. It is currently possible to obtain predictions at NLO for a PF-boson in combination 
with up to four jets 11701157112211 . 

The plots have been produced by re-analysing large event files produced by the combination of 
BlackHat and Sherpa. These files contain particle four-momenta as well as the coefficients of all scale 
dependent functions, including the PDFs so that it makes it possible to easily change factorisation and 
renormalisation scales as well as the PDF set. 

We used a common factorisation and renormalisation scale fip = /^i? = H^/2 with H^ = 
J2j Pt + ^^ where the sum runs over all jets and E^ = a/M^ + (p^)^. 

Estimation of uncertainties The estimation of the uncertainties for the NLO calculation obtained with 
BlackHath-Sherpa is obtained by combining in quadrature the pdf uncertainties obtained using the pdf 
error set and the uncertainties obtained by varying the factorisation and renormalisation scales simulta- 
neously by factors of 1/2 and 2. To this error we also add in quadrature the integration error estimate. 
Another way of estimating the uncertainties due to the choice of scales is to compare predictions obtained 
using different choice of basis scales, but this has not been done for this study. 

We used the CTEQ6.6 PDF set. The value of a^ used for this calculation has also been taken as 
that provided with this PDF set. The PDF uncertainties are estimated using the hessian method and PDF 
'error' set provided with the CTEQ6.6 PDF set. 

18.22 GoSam + Sherpa 

GoSam fT7\ is a new framework which allows the automated computation of one-loop scattering ampli- 
tudes for multi-particle processes. The one-loop scattering amplitudes are generated in terms of algebraic 
d-dimensional unintegrated amplitudes, which are obtained via Feynman diagrams. This allows to per- 
form symbolic manipulations of the expressions prior any numerical step. For the reduction, the program 
offers the possibility to use either a d-dimensional extension of the OPP method lll21[|122l [Tl9l. as im- 
plemented in Samurai [6], or tensor reduction as implemented in golem95 [il30J1311l interfaced through 
tensorial reconstruction at the integrand level II124II . 

The GoSam framework can be used to calculate one-loop corrections within both QCD and elec- 
troweak theory. Beyond the Standard Model theories can be interfaced using FeynRules I|137iI or 
LanHEP[136|. 

To produce results for a certain process specified by the user, the program must be fed with an 
"input card" with the details of the process. Alternatively, when interfacing the program with a Monte 
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Carlo (MC) event generator which supports the Binoth-Les-Houches-Accord (BLHA) interface II145I . 
the specific order file produced by the MC event generator can be passed to GoSam. 

The analysis presented here was performed using this latter generation mode and Sherpa 11425 [ 
11461 was chosen as MC event generator. Sherpa provides therefore the matrix elements for the pro- 
duction of W and exactly one jet at the Born-level and the NLO real corrections to it, together with the 
needed subtraction terms and their integrated counter-parts. GoSam provides the NLO virtual-part. The 
generation of the code follows the standards of the BLHA- interface 1 145 1. During the first call of Sherpa 
an "order file" is written by the MC program. This file is read-in by GoSam to produce the code for the 
one-loop evaluation of needed process. If this happens successfully, a contract file with information on 
the different possible subprocesses is produced by GoSam and can be later read by the MC generator to 
recognize the numbering of the different partonic subprocesses. At running time all information between 
GoSam and Sherpa is also passed using the BLHA-interface standards. 

The steering of the event generation and the analysis interface with RiVET II360II is done using 
Sherpa cards. Each curve in the analysis consists of 100 combined runs of 50 million events. The 
renormalisation and factorisation scales are set according to the choice made for this analysis in Les 
Houches to 

fJ-F = ^R = Hrp/2 , 

where iJ^ is defined in the previous section. 

Estimate of uncertainties The estimation of the uncertainties for the NLO calculation obtained with 
GoSam-i-Sherpa is done combining in quadrature the PDF uncertainties with the uncertainty coming 
from the separate variation of factorisation and renormalisation scale by factors of 1/2 and 2. Ideally also 
the integration error should be added in quadrature to the previous estimate, however the MC integration 
error obtained with Rivet at NLO is not reliable because of the incapacity of RiVET to take into account 
properly the correlation between real and subtraction events. For this reason and because of the very 
high statistics of the MC sample, the MC integration error is neglected. To assess the PDF uncertainty 
we compute the envelope of the results obtained using the three different PDF sets CTIO |I2551|, used 
as nominal set, MSTW08 [262] and NNPDF2.1 B12II . The total scale uncertainty is determined by 
adding in quadrature the factorisation and renormalisation scale uncertainties. Each of them is found by 
computing the maximum between the nominal value and the up and down variations. 

18.23 Hej 

The High Energy Jets (Hej) framework 114181 14 1911 provides an alternative description of collider events 
to the standard fixed order calculations (possibly interfaced to a parton shower). Instead, Hej uses ap- 
proximations to the hard scattering matrix element to all orders in a^ which become exact in the High 
Energy limit. The approximation results in sufficiently simple matrix elements, that these can be explic- 
itly regulated, integrated and summed over any (relevant) multiplicity. This results in an explicit all-order 
resummation of the dominant contributions from wide-angle QCD radiation. 

The building blocks of the Hej framework ensure the correct leading logarithmic behaviour in the 
Multi-Regge Kinematic limit (aka. the High Energy Limit) of large invariant mass between all partons, 
for both the real and virtual corrections. The resummed n-jet rate is then further matched to tree-level 
accuracy for events with up to and including four jets, using a merging procedure for the soft radiation. 

This procedure has so far been applied to the production of jets ||64l . W plus jets II449I . Z plus 
jets and Higgs boson plus jets and has currently been implemented in a fully flexible Monte Carlo for the 
first two of these processes. The implementation integrates explicitly over any number of QCD emissions 
from a {W, Z, H+) dijet system, and hence produces event samples for processes with two jets or more. 
Note that one has access to the momenta of all final state particles for every event and it is therefore 
extremely simple to restrict to a subset of the events if required, e.g. 3-jet exclusive events. 
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The Hej resummation includes emissions at large transverse momentum which are increasingly 
important as the centre-of-mass energy of particle collisions increases. Hej is currently the only available 
flexible Monte Carlo generator to obtain leading logarithmic accuracy in the limit of large invariant mass 
between emissions. However, the Hej framework does not include any systematic resummation in the 
coUinear limit. This is included in a parton shower, but a careful merging procedure is required to link 
one with Hej, as there is significant overlap between the soft emissions included in each approach; the 
first steps in this direction have been taken for jet production 1.431.1 and are ongoing. In the current study 
though, only parton level predictions are given. 

Estimate of Uncertainty The Hej framework does not contain any tunable parameters other than the 
choice of renormalisation and factorisation scale (just like any fixed order calculation). In this study, 
in common with other approaches, we choose both of these to be given by the geometric mean of the 
transverse momentum of the jets: 

/ n \ "- 

I^R = fiF = IUpt] ' (18-2-1) 

where the jets are defined according to the relevant cuts in each analysis. This is however only an 
arbitrary choice, as the framework admits any choice for the scale, including Ht, Pt of the hardest jet 
and a fixed scale. For a given scale, as is evaluated according to the relevant PDF. 

In common with standard convention, we calculate the scale variation by changing this scale by a 
factor of two in both directions. In principle, one could also include the PDF uncertainty, but this is not 
done in this study (as the scale uncertainty dominates). As described above, Hej contains matching to 
tree-level accuracy for up to four jets. However, unlike the merging procedure in a showered sample, the 
merging scale here is not a free parameter. There is only one rational choice for the merging scale: the 
minimum px of a jet in the relevant analysis. However, in an inclusive sample with at least two jets, one 
could use as a further estimator of uncertainty the variation obtained when matching to three and four jet 
LO matrix elements. This procedure will be studied in detail in Ref. II449L but in the present study, we 
quote the uncertainty only from the scale variation. 

18.24 MadGraph + Pythia 

MadGraph 111651 14461 11631 I191II is a general purpose leading order matrix element (ME) generator, 
with a broad variety of models available and easily extensible thanks to its modular structure. The event 
generation is performed by the MadEvent component, a tool implementing the Single Diagram En- 
hanced algorithm for multi-channel phase space integration. When a user provided process is specified, 
MadGraph automatically generates the amplitudes for all the relevant subprocesses and produces the 
mappings for the integration over the phase space. This process-dependent information is then used by 
MadEvent, where the process specific code generated allows the user to calculate cross sections and 
to produce unweighted events. Once the parton level events have been generated, a traditional parton 
shower (PS) Monte Carlo library can be run on top of the MadGraph output to describe additional QCD 
radiation, and possibly allow to produce hadron level generated events if a suitable hadronisation model 
is then applied. 

In order to avoid double counting of QCD radiation from the matrix element and the parton 
shower, the MLM matching approach is used in its ktMLM implementation provided by the MadGraph 
team [,191.1 . 

For the present study MadGraph-5.1.1 II191I1 has been used for the matrix element generation, 
while the parton shower and hadronisation has been provided by Pythia 6.4.2.4 II400II . The W + n 
jet process has been simulated up to 4 additional partons. The PDF used in both calculations has been 
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CTEQ6L1, and in the matrix element calculation the strong coupling costant has been setup to be equal 
to the one from the PDF used. The factorisation scale and the hadronisation scale are set to the W 
transverse mass, m±^w The parton level clusterisation scale xqcut has been set to 10 GeV, while the 
ME - PS matching scale qcut has been set to the optimal value of 20 GeV, determined ensuring the 
smoothness of the differential jet rate. 

The Pythia settings have been defined according the so called Tune Z2, an adjustment of Tune 
Zl described in II450II for CTEQ6L1, where the p± cutoff for the multiple parton interactions is set to 
PARP (82) =1 . 8 32 obtained on top of LHC data as far as the underlying event and multiple parton 
interactions are concerned, while the fragmentation parameters are those optimized on LEP data by the 
Professor 1145 111 team. 

Estimate of uncertainties To estimate the uncertainties due to the factorisation scale and the renor- 
malisation scale, which are set to m±^w^ we varied them simultaneously by a factor two. In addition we 
have independently varied by a factor two the ME - PS matching scale. 

While applying these modifications, the total cross-section is kept fixed to the value obtained 
with the default parameters, 27.77 nb. That is because we are only interested in shape variations of the 
distributions, rather than in the total cross-section of the process calculated by MadGraph, which is 
accurate only at the leading order. 

18.25 PowHEG Box + PythiaS 

The PowHEG Box |[T3l is a computer framework to ease the PowHEG 114161 implementation of new 
processes. It only requires as input the individual components of the NLO calculation under consider- 
ation, i.e., the Born process, its virtual radiative corrections and the real emission contributions. Then 
it automatically combines them, canceling the emerging soft and collinear singularities in the Frixione- 
Kunszt-Signer (FKS) subtraction scheme, and produces the required events. The POWHEG Box is also a 
library, where previously implemented processes are available in a common framework. 
For the present study we make use of the M^-i-jet implementation presented in 114401 . 

The produced events are passed to PythiaS [348] through the Les Houches interface f4521 and 
showered with the default tiansverse-momentum ordered shower, vetoing further emissions harder than 
the one already present in the input events. This is achieved by setting the starting scale of the shower as 
the transverse momentum of the hardest emissionr^ 

When multiple partonic interactions (MPI) are turned on, these are allowed to be harder than the 
first POWHEG emission. Indeed, since the VF + 1 jet process is not accounted for in MPI, there is no 
over-counting. 

Eventually, the relevant distributions are evaluated by interfacing the MonteCarlo output to the 



Rivet II360II analysis, for the two given sets of ATLAS and CMS cuts. 

Since we have simulated events starting from a hard process where a VF is produced in association 
with one jet, only observables built from events where at least 1 jet is present will be shown. 

Generation of predictions and estimate of uncertainties Predictions presented here are based on a 
merged sample of 4M W^ + j and 4M W~ + j weighted events, produced with the default POWHEG 
Box choice of parameters. In particular, we have required a minimum cut p-r = 5 GeV on the associated 
jet at the generation level and, in order to enhance the statistical sampling of the high-px tail, we have 
further suppressed the rapidly rising contribution at low jet p^ by the factor p"^/ (ptsupp +Pt)' with 
Pt supp = 100 GeV. The inverse of this factor enters the event weight. 



^^ An extra veto may be required at this stage, due to the different definitions of the transverse momentum used in the 
PoWHEG Box - either for initial or final state radiation - and in PythiaS. 
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We have adopted the PowHEG Box default values for EW parameters, namely 

Mw = 80.398 GeV , Tw = 2.141 GeV , (ffem)"^ = 128.89 , sin^ ^ w = 0.222645 (18.2.2) 
and we have assumed a CKM matrix with a mixing between the first two generations only 

\Vud\ = \Vcs\ = 0.975 , \Vus\ = \Vcd\ = 0.222 , and \Vtb\ = 1 . (18.2.3) 

Finally, we have resctricted the integration region to the interval < M]y < 2221 GeV. 

For the computation of the PowHEG B function, the renormahsation and factorisation scale was 
chosen equal to 

IJ-R = t^F = P±,j , (18.2.4) 

where p±j corresponds to the transverse-momentum of the (single) parton recoiling against the W boson 
in the so-called underlying Born kinematics Ii417i| . We have also run the code using 



fJ^R = f^F = 1/2 (Jm^ +p2^^^^p^.\ , (18.2.5) 



but no relevant differences were observed with respect to the aforementioned choice, being the two scales 
similar for the 14/^ + 1 jet processes at hand. 

The scales entering in the evaluation of parton distribution functions and of the strong coupling in 
the POWHEG Sudakov form factor are chosen to be equal to the transverse momentum of the POWHEG 
hardest emission PT7ll386l . 

Scale-uncertainty bands obtained by varying the factorisation and renormahsation scales entering 
the B function by a factor of two in either directions are used as an estimate of the theoretical error 
associated to higher order missing effects. 

The uncertainty due to the PDF choice was estimated generating events using three different sets 
(CTIO Il255]l . MSTW2008 1261, and NNPDF2.1 EH). The value of the strong couphng constant 
at AIz is consistently read from the PDF table used. The further showering performed by PYTHIA8 is 
instead performed with default PDF and a^ definitions, the difference being beyond the claimed accuracy 
of the calculation. In this study, we have used Pythia8, version 8.153. 

18.26 Pythias 

Pythia8 [348] is the latest incarnation of event generators of the Pythia family. At the heart of the gen- 
erator are parton showers that evolve high-scale processes to the scale of hadronisation, by generating 
splittings with DGLAP splitting kernels. The splitting scales are ordered in relative transverse momen- 
tum 11348 [|447]| . and the phase space is constructed in a dipole-like manner in order to capture soft gluon 
coherence effects 1145 31 . A key point of the evolution of partonic states in PYTHIA8 is that all perturbative 
components are interleaved II3481I4471 I454L i.e. multiple partonic interactions, space-like and time-like 
showers are all generated in one transverse-momentum ordered evolution sequence. This means that due 
to the competition for phase space, all steps in the event generation are correlated. For a detailed discus- 
sion how parameters of the interleaved shower evolution are tuned to collider data, see 114551 . PYTHIA8 
with additional matrix element corrections has so far not been tuned to data. Since in f441], only very 
small differences were seen for LEP between PYTHIA8 with and without matrix element merging, we 
expect only small re-tuning effects in the parameters of the Lund string model 114561 . Similarly, since 
we keep the low-scale modelling of PYTHIA8 largely intact, only small changes in the underlying event 
tuning are expected. We however expect that some re-tuning will be needed for jet shape data. 

It should be noted that PYTHIA8 includes a selection 2 — ;• 1 and 2 — ;■ 2 processes, as well as 
a limited variety of 2 — )• 3 processes, but does not contain a general ME generator. New processes, 
particularly for higher jet multiplicities, have to be made available in form of Les Houches Event (LHE) 
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II452II files. By virtue of matrix element corrections, PythiaS describes the first emission in W + jets 
with the full matrix element probability. When introducing matrix elements with one additional jet within 
matrix element merging, this allows to fully cancel the merging scale dependence for the first emission, 
while small merging scale dependencies enter when including further jets. Current versions of PythiaS 
include a general implementation of the Ckkw-l matrix element merging prescription 114451 . Please 
consult 1144 II for a detailed discussion of the implementation in PythiaS. 

Generation of the predictions To generate predictions with stand-alone PythiaS (i.e. without inclu- 
sion of matrix elements for W production in association with two or more jets), the built-in qq — )• W 
matrix element in PythiaS was used to generate the initial configuration. This was then evolved with 
to the hadronisation scale and the ensemble of partons hadronised using the Lund string model. For this 
study, we use the publicly available Pythia S.157, with CTEQ6L1 parton distribution functions, and 
the associated Tune 4C. Since [441] showed a large dependence of the quality of the matrix element 
merging on whether rapidity-ordered emissions are explicitly forbidden in space-like showers, results 
are presented with and without enforced rapidity ordering. 

The inclusion of matrix elements for additional jets into PythiaS is achieved with Ckkw-l merg- 
ing. All merging tasks are handled internally in Pythia S.157, allowing for a high degree of automation. 
This means that the user only needs to supply 

• Matrix element configurations in form of LHE files. 

• An identifier giving the hard process of interest. 

• A value of the merging scale. Facilities to allow the user to implement a her/his own merging scale 
definition are available. 

For this report, matrix element configurations with additional jets were generated with MadGraph/ 
MadEvent 111631 . and read into PythiaS in form of Les Houches Events. PythiaS then derives all pos- 
sible parton shower histories for an event, probabilistically chooses a history, and uses the reconstructed 
states and splitting scales to perform a re-weighting with Sudakov factors and ag values. This means 
each event will have a weight 
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where pi and xf are the the reconstructed shower splitting scales and momentum fractions of the in- 
coming partons in itz-direction, and Us^^ipi, Pi+i) the parton shower no-emission probability when 
evolving the state S+i from scale pi to pi+i. Osme gives the strong coupling used in the matrix element 
calculation. All reweighting factors are generated dynamically with help of the shower. The interleaved 
evolution of PythiaS is accommodated by consistently including effects of multiple interactions into the 



no-emission probabilities. A detailed description of the formalism is given in 114411 . 

As input for the current analysis, we have produced LHE files for W"*" + jets with up to four 
(three) additional jets at Tevatron (LHC) energies. The renormalisation scale in MadGraph was fixed 
to IjLr = Mz. For hadronic cross sections, CTEQ6L1 parton distributions (as implemented in LHAPDF 



II457I ) have been chosen at a factorisation scale pp = Mw, and the strong coupling in the ME was 
correspondingly fixed to as(Mz) = 0.129783. To regularise QCD divergences and act as a merging 
scale, a cut in 

kl = mill !.mmipl^,,plj),mm{pl,,plj) ^'^'^''^ ^2"^^''^ } with D = 0.4 

and a cut value of k±^rnin = ^ms = 15 GeV has been applied to the matrix element. 
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Merged PythiaS predictions are given for the default settings, i.e. using the parameters of Tune 
4C, for Tune A2 [458 1, and for Tune 4C without enforced rapidity ordering (dubbed Tune X). Again, it 
should be noted that so far, no tuning including additional jets has so far been conducted. 

Estimate of uncertainties To estimate uncertainties of a merged prediction of W+jets, it is interesting 
to study the dependence on the merging scale value. For this, we have generated LHE files with three 
different k±^min = *ms cuts (tus = 15, 30, 45) GeV, and performed Ckkw-l merging on these samples. 
Furthermore, to show the effect of tuning, the tMs = 15-GeV-sample was processed for two adequate 
tunes. Tune 4C and A2. 

Uncertainties related to shower ordering In 114411 . it was shown that restricting shower emissions 
in PythiaS to regions of phase space ordered both in transverse momentum and rapidity leads to non- 
negligible effects in merged predictions. This can be seen as an effect of limiting the shower accuracy by 
reducing the phase space over which splitting kernels are integrated, meaning the accuracy of Sudakov 
form factors is impaired. Loosely speaking, if above the merging scale, the matrix element, integrated 
over the full phase spacep^ differs substantially from the splitting probabilities integrated over the al- 
lowed parton shower phase space, merged results will exhibit substantial merging scale dependencies. 
Such problems are obviously introduced if the parton shower phase space is heavily constrained. 

Changing the phase space regions in which the shower is allowed to radiate thus allows us to esti- 
mate the uncertainties of the merging procedure in conjunction with the underlying shower. Particularly, 
this procedure can test the quality of the matrix element merging beyond the first few emissions, and give 
hints on how the shower resummation may be improved. 

To emphasise the impact of the shower transition probabilities, we choose a fairly small merging 
scale (tus = 15 GeV) to regularise the tree-level matrix elements for this investigation. Then, for each 
matrix element state, we generate all possible parton shower histories for a matrix element state, by 
clustering emissions. This is achieved by inverting the shower momentum- and flavour-mappings. 

When merging matrix elements with rapidity-ordered showers, we investigate two ways of biasing 
the selection of a particular history, from which to generate the necessary Sudakov form factors: 

1. In a "y -blind" sample, we do not include an additional discriminant based on rapidity. This means 
that -just like in the standard case - p-ordered will be preferred over p-unordered ones. 

2. In a "y-conscious" sample, we pick histories with rapidity-unordered splittings only if no rapidity- 
ordered histories were found. Adopting this strict ordering criterion, histories ordered in p and 
rapidity will be chosen predominantly, and only if no such history exists, histories un-ordered in 
either p and/or rapidity are picked. 

It should be noted that to the accuracy of the parton shower, both these prescriptions are equivalent, 
and switching the choice of histories gives a real estimate of the quality of the merging in conjunction the 
underlying shower. We believe that including this uncertainty gives a pessimistic view on how wide the 
range of predictions of one merged calculation can be, indicating that although standard by now, matrix 
element merging in PythiaS should be applied with care. However, with reasonable settings, including 
additional jets can improve the description of multiple hard jets substantially. 

18.27 Sherpa 

Sherpa Il425[|146l is a full-fledged event generator capable of simulating all aspects of particle collisions 
as they occur at particle accelerators such as the Tevatron or the LHC. It includes two independent ma- 
trix element generators, AMEGIC++ II147I and COMix II459I1 . to generate cross sections and distributions 



"" Particularly for high jet multiplicities, it could be imagined that phase space integrators have difficulties to fully sample 
the phase space, especially close to kinematic limits. By full phase space, we mean the region that the phase space generator 
actually filled. 
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for final state multiplicities of up to six to ten particles. In the former one, methods to automatically 
generate dipole subtraction terms in the widely used Catani-Seymour scheme 112361 11501 have been 
incorporated 1.448 1 ; the Sherpa package also supports the BLHA pi451 for the interface to one-loop 
programs such as BlackHat or GoSam. For parton showering, Sherpa employs an algorithm based 
on Catani-Seymour subtraction kernels, proposed in II460II and implemented in the Sherpa framework 
in f^Bl]. For the hadronisation, Sherpa uses either its native hadronisation scheme, based on the cluster 



fragmentation model [.462.1 and its implementation described in 1.463.1 or an interface to Pythia II400I 
providing access to the routines of the Lund string model II456II . Both have been successfully tuned to 
LEP data within the Sherpa framework, with a similar quality in describing the data. The hadron de- 
cays are also fully provided in the Sherpa framework, as well as QED final state radiation to both the 
VF -boson and the hadron decays, simulated using the YFS approach [464. .465.1 . 

In this work, the most recent, publically available Sherpa version, Sherpa- 1.3.1, has been used 
in two ways of running the simulation, namely 

1. in the MePs mode: 

In this method, towers of LO matrix elements with increasing jet multiplicity, in the case at hand 
W, W+l,W+2, ..., M^+nj jets, are merged in the spirit of 114291 1430 1 to yield an inclusive sam- 
ple. In fact, codes relying on such algorithms have been compared in a previous publication [439|, 
which helped to establish and validate the methods and their various implementations. In contrast 
to the original implementation in Sherpa II466I1 . which used analytical forms of Sudakov form 
factors etc., the current version of the method ['4261 directly uses the parton shower for Sudakov 
rejections etc. and is thus closer in spirit to the variant presented in [i445i,:467il for multijet merging. 

2. in the MeNloPS mode: 

This method can be understood as the combination of a matching of the parton shower to a NLO 
matrix element and a merging of additional towers of LO matrix elements with even higher jet 
multiplicities. Thus, in the case at hand, inclusive W production calculated at NLO accuracy is 
merged, as above, with LO matrix elements for W + 1, W + 2, ..., W + nj jets. This method 
has been pioneered in Ii442[ 14431 where the implementation employed within Sherpa has been 
detailed in the second reference. 

The respective settings and relevant details for both simulation modes are described below. 

Sherpa in MePs mode In the MePs mode Sherpa was run with up to n j = 6 jets in the matrix ele- 
ment evaluation including all possible massless (anti-)quark and gluon initial and final states. All matrix 
elements were generated using COMIX. The MEPs-separation parameter was set to Qcut = 20 GeV, for 
its precise definition see 14261. The scales are chosen as 

aJ+"(Meff) = aj(;u) • asip±,i) ■...■ as{p±,n) , (18.2.6) 

wherein the relative transverse momenta p±^i are the nodal values of the final state partons of the W + n 
parton matrix element as obtained from recombining it using the inverted splitting probablities given 
by the parton shower. The core scale fi is then chosen as the partonic centre-of-mass energy of the 
reconstructed core process, i.e. jj? = S2->2 where A; = in the process at hand. In all stricly perturbative 
setups a parton shower cutoff of to = (0.7 GeV)^ has been used. 

The parton shower cutoff and all fragmentation parameters of both the internal cluster hadroni- 
sation and the interfaced Lund string fragmentation models have been tuned to LEP data and give a 
similarly good description. Similarly, the parameters of Sherpa's MPI model have been tuned to Teva- 
tron and LHC data using the CTIO [|255 1 parton density parametrisation. These parameters are given in 



App. 18.71 
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Sherpa in MeNloPs mode In the MeNloPs mode Sherpa is run with essentially the same param- 
eters as in the MePs mode, described in the previous subsection. Hence, nj = 6 and Qcut = 20 GeV. 
To be able to describe the inclusive W production process at NLO accuracy, AMEGIC++ was used for 
all parts of the NLO W production matrix elements (supplemented with a hardcoded one-loop matrix 
element from the internal library) and the LO W + 1 parton matrix element. Consecutively, the scales 
were chosen as above with /c = for all tree-level parts and A; = 1 for the real and virtual corrections 
entering the next-to-leading order correction of the core process. All non-perturbative parameters remain 
unchanged wrt. the MePs mode. 

Estimate of uncertainties In order to estimate the uncertainites of the Sherpa predictions, the follow- 
ing procedures have been applied: 

(A) PDF uncertainties: 

Unlike in the PDF4LHC presciption 114681 . here only the central predictions of the three NLO 
PDFs, CTIO 12551, MSTW2008 ||26l and NNPDF2.1 |[3T2]| are compared to estimate the PDF 
uncertainties. The different parametrisations of PDFs as well as their corresponding value of a^, 
both its value at Mz and its running, enter in the calculation of the matrix elements, the parton 
shower and the underlying event. 

(B) Scale uncertainties: 

In a global manner, all scales, renormalisation and factorisation scales are simultaneously modified 
by the canonical multiplication with 2 and 1/2. This, however, is not only applied to the evaluation 
of the matrix elements but also to that of the parton shower, the hadronisation, the underlying event 
simulation and the hadron decays. Regarding the matrix-element evaluation, the MePs default 
scale choice forms the starting point for the scale variations to be executed. 

(C) Hadronisation uncertainty: 

Here the intrinsic modeling uncertainties are evaluated by changing the hadronisation model op- 
erating on Sherpa's parton shower final states, namely switching from Sherpa's default cluster 
hadronisation to Pythia's string fragmentation. For both schemes, an independently tuned set of 
parameters has been employed to perform the parton-to-hadron transition. 

(D) Underlying event uncertainty: 

To this end the tune of the underlying event based on using the CTIO PDF has been modified 
such that the plateau of the number of charged particles and sum of transverse momenta in the 
transverse region are increased or decreased by 10%. This change in the amount of MPI activity is 
accomphshed by varying the o-nd correction factor (SIGMAJND_FACTOR) by -0.04 or +0.05, 
respectively. 

18.3 Results 

In this section we compile results for the individual codes for a number of representative observables 
at the different levels of the simulation. It should be noted, though, that in all results presented in this 
section PDF uncertainties have been estimated by typically varying only over a few different sets rather 
than employing the full procedure as suggested by the PDF4LHC accord II468I . 

18.31 BlackHat + Sherpa 

The following results have been obtained with BlackHat-i-Sherpa. Uncertainties due to the factorisa- 
tion/renormalisation scale variation and the that due to the PDF uncertainties are shown. The yellow 
band corresponds to the addition in quardature of these two uncertainties and the statistical estimation on 



the integration error. All observables are defined using the ATLAS cuts, cf . App. |18.6 



Fig. [64] displays the inclusive cross section for a W boson in association with n jets, where 
71=1,2,3,4. A NLO computation of W+4 jets also provides a leading order calculation of the W^-i-5 
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jets rate, but since it is not at NLO accuracy we refrain here from including it. 

In all the plots presented in this section the uncertainties are dominated by the uncertainty arising 
from the scale variation (it is not the case when the central scale of the process is chosen close to a local 
maximum, in which case the upper boundary of the scale variation is very close or identical with the 
central value, as can be seen from the plots corresponding to VF+3,4 jets). This is partially due to the fact 
that for the assessment of the PDF uncertainty only error sets have been employed that are closely related 
to the central set. In addition, the functional form of the scale definition as given by the kineamtics of 
the final state has not been changed, but rather the emerging scales fip and hr have been multiplied in 
parallel by factors of 2 and 1/2. 
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BlackHat+Sherpa. matrix element level. ATLAS cuts 



BlackHat+Sherpa, matrix element level, ATLAS cuts 
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Fig. 61: Pseudo-rapidity and transverse momentum distributions for the first jet in inclusive VF + 1 jet 
production (upper panel), for the second jet in inclusive VF + 2 jet production (central panel), anf for the 
third jet in inclusive PV^ + 3 jet production (lower panel). 
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BlackHat+Sherpa. matrix element level, ATLAS cuts 



BlackHat+Slierpa. matrix element level. ATLAS cuts 
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Fig. 62: Pseudo-rapidity and transverse momentum distributions for the fourth jet in 1^+4 jet production. 



BlackHat+Sherpa. matrix element level, ATLAS cuts 



BlackHat+Sherpa, matrix element level. ATLAS ctits 
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Fig. 63: HT distributions for event with at least one (top left), two (top right), three (bottom left) or four 
(bottom right) jets. 
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BlackHat+Sherpa, matrix element level, ATLAS cuts 
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Fig. 64: Inclusive cross section for T^ + n jet production. 



18.32 GoSam + Sherpa 

The setup described in the previous section for the analysis using GoSam+Sherpa gives the following 
theoretical uncertainties. The plots show that in general the scale uncertainties are bigger then the PDF 
uncertainties and that the renormalisation scale dependence is usually bigger then the dependence on 
the factorisation scale. To illustrate the decrease in the scale uncertainty given by the NLO calcula- 
tion we also include the distributions for the pseudo-rapidity and transverse momentum of the second 
hardest jet, which have only tree-level accuracy. All observables shown are defined using the ATLAS 
cuts, cf. App. |18.6[ Note that errors in the 2-jet configuration are increased w.r.t. those provided by 



BlackHat-hSherpa, since here only M^ + 1 jet configurations are dealt with at NLO, and the 2-jet 
configurations therefore are descibed at LO only. 
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Fig. 65: Pseudo-rapidity and transverse momentum distributions for the hardest jet. 
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Fig. 66: Pseudo-rapidity and transverse momentum distributions for the second hardest jet. This distri- 
bution have formally leading order accuracy and have therefore a much larger scale dependence than the 
same distribution for the hardest jet, for which a genuine NLO prediction is available. 
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Fig. 67: 
jet. 



HT distributions (left) and AR between lepton and hardest jet (right) for events with at least one 



18.33 Hej 

This section contains the predictions from the High Energy Jets (Hej) event generator. This gives pre- 
dictions for the production oi aW boson in association with at least two jets. Throughout, we show 
results for CTEQ, MSTW and NNPDF parton distributions. We show a scale uncertainty band only for 
the first of these for clarity. The results for the other two are very similar. The yellow band in the ratio 
panel shows the statistical uncertainty in each case. The scale variation is seen to be dominant over the 
statistical uncertainty and the differences in choice of pdf . All observables are defined using the CMS 



cut definitions, cf . App. |18.6 



As discussed in Sec. 18.23 the resummation contained in the Hej framework is supplemented 
with a merging procedure to ensure tree-level accuracy for events with up to and including four jets. This 
leads to the larger drop from the four jet to the five jet cross section, compared to the drop either from 
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Fig. 68: The HEJ prediction for the distribution of the transverse mass of the W boson (top left) and for 
the angle between the hardest jet and the charged lepton from the decay of the W boson (top right), the 
transverse momentum of the hardest jet (bottom left) and for the Ht distribution (bottom right) in events 
where a W boson was produced in association with at least two jets. 
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Fig. 69: The HEJ prediction for the cross sections of W plus n jets. 
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three-jet to four-jet, or from five -jet to six-jet. Tfiis can be clearly seen in Fig. 69 



18.34 MadGraph + Pythia 

The following results have been obtained with MadGraph-i-Pythia. Uncertainties due to the factorisa- 
tion and renormalisation scale and MePs matching scale are shown for results on hadron level including 
UE and QED final state radiation. A comparison of results on parton shower level, hadron level, hadron 
level including UE, and hadron level including UE and QED final state radiation, is also presented. All 



observables shown are defined using the CMS cuts, cf. App. 18.6 



From these results we can conclude that the largest uncertainty on all observables is due to the 
factorisation and renormalisation scale. In addition to that, a large effect is found by switching off the 
final state QED radiation, while only a small difference is oberved between results at the shower level 
and all other results prior to the QED radiation. 
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Fig. 70: MadGraph+Pythia results for W transverse mass. 
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Fig. 71: MadGraph+Pythia results for AR between lepton and hardest jet. 
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Fig. 72: MadGraph+Pythia results for for px_ of hardest jet (top), number of jets (middle) and Ht of 
events with at least 2 jets (bottom). 
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18.35 PowHEG Box + PythiaS 

In this section we show results obtained by running the PowHEG Box implementation of T^ + 1 jet to- 
gether with PythiaS. In all the following plots of this section, CMS analysis cuts have been enforced, 
see App. 18.6 For this study, in the left panels of Figs. 73]|75 we show uncertainties obtained from vari- 
ations of renormalisation and factorisation scales by a factor of two in either directions and by choosing 
different PDF sets in the computation of the hard scattering. Results are shown at the final level, after 
the shower, the hadronisation and the inclusion of MPI, all performed by PythiaS. In general, we notice 
that the uncertainty due to scale variations is greater than the changes in the results due to different PDF 
choices. 

In the right panels of Figs. 73]|75 we show our results at different stages of the simulation, for a 
fixed PDF set (chosen to be CTIO). The stages considered include from the first emission level up to the 
full showered events in PythiaS, including MPI and also effects due to QED radiation off leptons and 
quarks. Various stages of the simulation have been obtained setting the PythiaS switches as reported in 
Sec. [T03l 

We recall here that results should be considered to be physical only after the the hadron level is 
reached (possibly including MPI and QED effects). In particular, we stress that the results at the parton 
level are obtained considering only the POWHEG first emission, and they are therefore only intermediate: 
indeed at this stage only the hardest radiation has been generated and effects due to further showering 
are not yet taken into account. 

For most of the observables results do not show large variations going from a simulation level to 
another. In particular, for truly NLO predictions such the plots in Fig. 74 or the bin rijct = 1 of Fig. 73 the 



major effects that arise at each successive stage of the simulation are a change in the normalisation, due to 
a slightly different number of events passing the analysis cuts when multiple emissions are allowed, and a 
moderate shape distortion in the low end of the spectrum. Both these effects may be attributed to multiple 
QCD radiation due to Sudakov effects introduced by the parton shower. As expected, these effects are 
of the same size, or smaller, than the theoretical uncertainty due to scale and PDF's variations, when 
propagated to the hadronic level. Similar effects are also observed when the QED radiation is turned on. 
In this case, results are lowered as a consequence of the cuts on the lepton transverse momentum and 
rapidity. 

Due to the requirement of having at least two jets, the remaining observables are predicted only at 
leading order or with leading log accuracy by the POWHEG simulation of VF + 1 jet. This is also reflected 
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Fig. 73: The number of jets, as predicted by POWHEG Box -i- PythiaS. 
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POWHEG BOX + Pythia8 hadron level, with MPI, CMS cuts 
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Fig. 74: The hardest jet transverse momentum distribution (upper plots), the AR separation (middle 
plots) and the invariant mass m (lower plots) of the hardest jets and the hardest lepton, as predicted by 
POWHEG Box + PythiaS. 
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Fig. 75: The transverse momentum p± of the next-to-hardest jet, the scalar sum of the jet transverse 
energy Ht of events with at least 2 jets and the sum of the transverse energies of all the particles in 
events with 2 or more jets, as predicted by POWHEG Box + PythiaS. 
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Fig. 76: Comparison between predictions using different PythiaS tunes, at hadron level with MPI, as 
predicted by POWHEG Box + PythiaS. 



in the larger band associated with the scale variations. 

Observables such as Ht, the scalar sum of the transverse energy of the jets for events with two or 
more jets, show an enhancement in the high-//r tail. This effect mostly arise as a consequence of the 
showering, since the successive stages do not change the predictions any longer. The same behaviour, 
even more enhanced, is also observed in the scalar sum of the transverse energy of all particles, always 
in events with two or more jets. 



In Fig. 76 we instead compare the effect of using different PythiaS tunes on our predictions, 
obtained in this case at the hadron level, including MPI. Essentially all the observables turned out to be 



extremely stable under the variations of the PythiaS tune, as shown in Fig. 76 Major differences only 
appears for the beam thrust, when it is defined at the particle level (see App. 1S.62| ). 



18.36 PythiaS 

For this study, PythiaS has been run stand-alone and including matrix elements with additional jets. 
Note that in PythiaS, multiple interactions are interleaved with space- and time-like showers, meaning 
that in general, MPI and parton showers cannot be disentangled by just switching off secondary scat- 
terings. When referring to "Hadron Level", we mean after the interleaved evolution (including QED 
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Fig. 77: Tuning variations for PythiaS at hadron level. The plots show the iJ^-distribution when requir- 
ing at least two jets (upper left), the px of the hardest jet (upper right), the Z\i?-separation of lepton and 
the hardest jet (lower left), and the number of jets (lower right). The lower insets show the ratio of the 
samples in the upper half to ME3PS (Tune 4C, y-blind treatment). All merged plots are produced with a 
merging scale of tws = 15 GeV. 



splittings), and after hadronisation. For the sake of comparison, "Shower Level" indicates results after 
(interleaved) final- and initial-state radiation, switching multiparton interactions off. All results presented 
in this section are generated with CTEQ6L1 parton distributions for protons colliding at Ecu. = 7000 
GeV. CKKW-L-merged samples include up to three additional jets, taken from MadGraph/MadEvent. 



Fig. 77 exemplifies how changes in the tuning of the event generator can affect the outcome of 
merged calculations in PythiaS. For this, we produce predictions for Tune 4C II455II and Tune A2 II458I . 
In general we observe only modest shape changes of up to about 20% in observables, when comparing the 
two merged predictions, lending confidence to the statement that the tuning did not artificially produce 
hard scale physics. Normalisation changes between 4C and A2 can be explained by a difference in 
Sudakov suppression: Since Tune 4C integrates the splitting kernels over a smaller region of phase 
space, the suppression generated by trial showers is less pronounced. The increase in the number of jets 
in Tune A2 with respect to Tune 4C, after the third jet, is expected, because the generation of the fourth 
jet is handled solely by the parton shower. Since 4C allows less phase space for these emissions by 
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Fig. 78: Variation of the merging scale value for PythiaS at shower level. The plots show the Ht- 
distribution when requiring at least two jets (upper left), the px of the hardest jet (upper right), the 
Z\i?-separation of lepton and the hardest jet (lower left), and the number of jets (lower right). The lower 
insets show the ratio of the samples in the upper half to ME3PS for ^ms = 30 GeV. All plots are generated 
using Tune 4C (y-blind treatment). 

enforcing rapidity ordering, A2 will look harder. It is debatable whether including rapidity ordering into 
the tuning makes the tune mimic hard scale effects. The scales at which the fourth jet is produced are 
certainly close to the scale of (hard) multiple interactions, which is in turn closely connect to soft physics. 
Although the enforced rapidity ordering in Tune 4C might be considered questionable, we here take the 
pragmatic approach of considering the evolution both with and without enforced rapidity ordering. From 
the fact that up to three jets, the merged predictions of Tune 4C and Tune A2 only differ in normalisation, 
we anticipate that the effect of rapidity ordering will be reduced by merging more jets, since then, the 
number of jets above a cut-off will be dictated by the matrix element. 



In Fig. 78 we investigate the impact of changes in the merging scale value. Again, we mainly see 
normalisation changes and only small changes in shape, which in most cases are smaller than changes due 
to different tunes. The /^-separation between lepton and hardest jet Z\i?(lepton,hardest jet) shows signif- 



icant shape changes above tt. This again is an effect of Tune 4C, and is greatly reduced in Tune A2™| as 



For the sake of compactness, merging scale variation plots for Tune A2 could not be included here. We hope the reader is 
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Fig. 79: Variation of the criterion employed to favour "ordered histories" in PythiaS at shower level. 
The plots show the //^-distribution when requiring at least two jets (upper left), the p± of the hardest jet 
(upper right), the Z\i?-separation of lepton and the hardest jet (lower left), and the number of jets (lower 
right). The lower insets show the ratio of the samples in the upper half to ME3PS (Tune 4C, y-blind 
treatment). All merged plots are produced with a merging scale of ^ms = 15 GeV. 

can be inferred from the tune variation. However, even in Tune A2, small shape changes remain, with the 
change becoming less pronounced when comparing two large merging scales. We take this as an indica- 
tion that the shower splitting probability - giving radiative contributions to Z\i?(lepton,hardest jet) > n 
for high tws - and the the matrix element, which fills the same region in for the low merging scale case, 
are indeed different from the second jet on. This also explains the difference between Tune 4C and Tune 
A2, which differ by the phase space regions over which the splitting kernels are integrated. 

Finally, in Fig. [79} we address the interplay of matrix element merging and ordering in the underly- 
ing shower more carefully. The effect of different choices manifests itself again mainly in changes of the 
normalisation of the plots, and is comparable in magnitude to the impact of merging scale variations. At 
first, the changes may seem counter-intuitive, and need clarification. For this, it is important to remember 
the definition of "y-blind" and "y-conscious" in section 18.26 The y-blind treatment will - irrespectively 
of rapidity configurations - mainly choose histories ordered in the shower evolution variable p, and only 



nevertheless willing to consider the following argument - which is only supported by the omitted results. 
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pick p-unordered histories if no otlier ones have been constructed. However, in the y-conscious approach, 
once no history ordered both in rapidity and p is found, one amongst all un-ordered histories is chosen 
probabilistically, irrespectively of the history being y-lp-lov y- and p-unordered. Since the ordering crite- 
rion is stricter, un-ordered histories will be chosen more frequently, meaning that p-unordered ones will 
also contribute more, compared to the y-blind case. Matrix element states with no ordered histories will 
have a number of jets at very similar scales, so that the Sudakov suppression generated by trial showers 
will be smaller. Moreover, for matrix element states in which the last reconstructed splitting is unordered, 
the parton shower will be started at the larger of the unordered scaleq^ which can result in a slightly 
harder spectrum of resolved parton shower jets. Because p-unordered states are picked more often when 
requiring a tighter ordering criterion, this leads in visible differences. The y-conscious method might 
seem somewhat artificial, considering that it introduces a larger dependence on states outside the range 
of even the y-unordered shower variant. Nevertheless, the y-blind and y-conscious prescriptions are 
equivalent to the accuracy of the (y-ordered) shower, so that both should be investigated when assessing 
the quality of the merging. From the visible changes, we can infer that different treatments of formally 
sub-leading effects do matter. For the y-ordered evolution, these are more visible since the accuracy of 
the shower itself is worse, so that the effects of including matrix element states cancel to a lesser degree. 
It is interesting to note that the deviations between the different prescriptions are considerably smaller if 
the merging scale is increased, again hinting at a reduced shower accuracy if the evolution is ordered in 
multiple variables. 



Fig. 79 further shows distributions labelled Tune X, which have been generated by using Tune 
4C, removing the rapidity constraint on space-like emissions, and treating histories y-blind. Results of 
these runs, as expected, closely follow Tune A2. The outcome of both Tune A2 and Tune X differs 
only slightly from the Tune 4C (y-blind) curves, consolidating the conclusion that shifting fractions of 
p-un-ordered histories are responsible for the deviations between the y-blind and y-conscious methods. 
As in the discussion of tuning variation, the similarity in the results of the merged calculation for Tune 
4C and Tune A2 breaks down once we examine jets that are solely produced by the shower, i.e. starting 
from the fourth jet. 

18.37 Sherpa 



As described in Sec. 18.27 Sherpa has been run in two modes for this comparison of LHC predictions. 



The results for the conventional merging of towers of tree-level matrix elements, Sherpa MePs, are 



presented in Sec. 18.37 while the results of its enhancement to NLO accuracy in the core W produc- 



tion process, Sherpa MeNloPs, are displayed in Sec. 18.37 As detailed earlier, all parameters have 
been chosen identically otherwise. The precise requirements regarding the event selection and the def- 
initions of the observables used in this comparsion follow the CMS cut specifications and can be found 



in App.fTS^ 



Sherpa MePs Figs. [80p3] show the results as obtained by running Sherpa in the MePs mode for 



a variety of inclusive and multi-jet observables at different levels of the event generation. All central 
results are displayed together with their respective uncertainties related to the different sources listed in 



Sec. 18.27 The layout in all figures is the same: the upper left and right panels respectively show the 
matrix element level and parton shower level predictions for a given observable. The matrix element level 
is defined as the event generation phase right before the parton showering. For the MePs approach this 
means that modifications necessary for the procedure to work like Us reweighting and Sudakov rejection 
have been already included at this level. The predictions presented in all centre panels were generated 
after enhancing the event generation to include corrections induced by the parton-to-hadron transition and 
decays of the therein produced primordial hadrons. On top of these soft physics effects, one has to also 
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Sherpa MEPS parton shower level. CMS cuts 
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Fig. 80: Sherpa MePs. Uncertainty of the transverse mass of the reconstructed W on the matrix element 
level (upper left), after parton showering (upper right), including hadronisation correction (centre left), 
multiple parton interactions (centre right), and QED corrections (lower left). The lower right panel shows 
the evolution of the central value. 
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Sherpa. MEPS matrix element level, CMS cuts 



Sherpa MEPS partou shower level, CMS cuts 
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Fig. 81: Sherpa MePs. Uncertainty of the angular separation of the charged lepton and the hardest jet 
on the matrix element level (upper left), after parton showering (upper right), including hadronisation 
correction (centre left), multiple parton interactions (centre right), and QED corrections (lower left). The 
lower right panel shows the evolution of the central value. 
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Sherpa. MEPS matrix element level, CMS cuts 



Sherpa MEPS partou shower level. CMS cuts 
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Fig. 82: Sherpa MePs. Uncertainty of the transverse momentum of the hardest jet on the matrix element 
level (upper left), after parton showering (upper right), including hadronisation correction (centre left), 
multiple parton interactions (centre right), and QED corrections (lower left). The lower right panel shows 
the evolution of the central value. 
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Fig. 83: Sherpa MePs. Uncertainty of the inclusive jet multiplicity on the matrix element level (up- 
per left), after parton showering (upper right), including hadronisation correction (centre left), multiple 
parton interactions (centre right), and QED corrections (lower left). The lower right panel shows the 
evolution of the central value. 
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account for multiple parton interactions. The results in the centre right panels of all figures incorporate 
these additional corrections. Finally, all plots to the lower left show the most complete hadron level 
predictions, which were obtained by adding to the event generation QED radiation effects as occurring 
in the decays of the vector boson and the hadrons. To allow a direct comparison of the impact of the 
consecutive event stages, the way the central results change is summarized in all plots to the lower right 



of Figs. 80 83 In these, as in all other panels, the main plots are supplemented by ratio plots stressing 
the magnitude of the differences and uncertainties. Note that the yellow band throughout illustrates the 
statistical uncertainty on the central event sample. 

Apart from the summary plots at the lower right, all other cases depict predictions documenting 
the uncertainty of the central predictions at the different levels of event generation. These uncertainty 



estimates are gained following the procedures outlined in Sec. 18.27 At all event simulation phases, 
the scales are varied as described under this section's point (B). Note that the variation is applied to all 
phases used to make up the respective central (or default) sample, which is taken as the reference under 
all circumstances. For the matrix element level, parton shower level and full hadron level results, PDF 
variations according to point (A) are shown in addition, whereas for the centre panel plots, the focus 
is on the outcomes of the model and tune variations instead, as specified in point (C) and point (D) of 



Sec. 18.27 Notice that the lower left panels also contain the outcomes of scale variations utihzing the 
alternative PDFs mentioned under point (A); they are much alike the ones stemming from the default set. 

As an example for an inclusive observable the transverse mass of the reconstructed W boson 
is shown in Fig. [80] The scale uncertainties amount to ~15% at all generation levels, whereas the 
uncertainties due to the choice of PDF are much smaller. Similarly, the hadronisation uncertainty is 
negligible. The m±^w observable however is more sensitive to the tuning of the MPI model as can be 
seen from the ±10% envelope in the centre right plot of Fig. [80] The uncertainty is of the same order 
as for the scale variations, which generally are more pronounced in the soft region. When considering 
the impact of each perturbative and non-perturbative event stage (see the plot to the lower right), it is 
the MPI corrections that are largest in the region of m±^w < rn^, ranging up to ~30% wrt. the matrix 
element level prediction. They are small above tuw- In this region the dominant effect comes from the 
QED corrections, which themselves are rather small, but they lead to a contamination of the electron 
isolation. The application of the isolation cuts then yields a reduction of the overall normalisation of the 
event sample. Finally there is a small shift towards lower transverse masses, pronouncing the deviation 
in the tail of the distribution somewhat further. 



Fig. 81 and Fig. [82] depict observables that require the presence of at least one jet. In the former 



the geometric separation, AR, between the hardest jet and the electron is shown, while in the latter, focus 
is on the transverse momentum, p± of the hardest jet only. As before the dependence of the predictions 
on PDF and hadronisation model changes remains negligible. While the scale dependence of the AR 
and p± variables increases to ~30%, the uncertainty due to the tuning of the MPI model decreases to 
~5% when compared to the findings concerning the more inclusive observable considered above. In both 
cases the reason for the sensitivity change obviously lies in demanding at least one (hard) jet. The scale 
uncertainties primarily result from changes in the overall cross section. Again, comparing the results of 
the different event stages, one clearly observes the large impact parton showering has on modifying the 
matrix element level predictions. The non-perturbative effects go in the same direction amplifying the 
parton shower effects, but as expected this amplification turns out to be rather mild in the well separated 
and/or hard phase space regions. QED corrections only play a minor role, and are far less important than 
for the m±\Y variable. 



In Fig. 83 one of the simplest examples of a multi-jet observable is presented, namely the distri- 
bution of the inclusive W + n jet cross sections as a function of rijet. Qualitatively, the parameter and 
model dependencies of the predictions are found to behave as for the inclusive one-jet variables. As one 
would expect, the scale uncertainties subsequently increase with the order of the jet bin. The same can be 
noticed for the variation of the PDFs used in the calculation - even though here the effect is considerably 
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smaller. 

Sherpa MeNloPs Following the outline of the previous subsection, Figs. [84p7] compile the results, 
which were obtained by executing Sherpa in the MeNloPs mode, cf. Sec. |18.27[ The presentation is 
based on the same set of figures where the selection of the observables has been taken as in the MePs 
case. Again, all (central) predictions are examined towards their scale, PDF, non-perturbative modeling 
and QED simulation dependence. One small difference has to be pointed out: the plots to the lower left 
now depict exclusively to what extent the additional QED corrections modify the outcomes including 
multiple parton interactions and hadronisation effects. 



Fig. 84 shows the transverse mass of the reconstructed W boson. In the MeNloPS approach, this 
observable is described at NLO accuracy, which leads to a reduction of the associated scale uncertainties. 
The scale variation results for m± w nicely confirm this expectation as can be seen in the upper four plots 



of Fig. 84 The deviations from the central prediction are much smaller than those found for the MePs 
scenario exhibited in Fig.[80| they now are of similar magnitude as the PDF uncertainties. While the scale 
dependence is reduced, PDF and MPI tune variations as well as QED corrections manifest themselves 



as in the MePs case. In particular, the discussion around Fig. 80 explaining the effects of extra QED 



emissions (as being most relevant in the W decay) can be used to understand the findings illustrated in 



the bottom left panel of Fig. 84 



The MeNloPs method primarily improves the precision of the description of the core process, here 
the description of the W production process. One also benefits from improving the overall normalisation. 
However, processes with additional partons in the final state are described in the MeNloPs approach at 
the same level of accuracy as in the MePs approach - in both cases by tree-level matrix elements. Thus, 
the one-jet observables, AR between the lepton and leading jet and the p± of the leading jet, and their 
related uncertainties turn out to be predicted in a very similar manner. This can be clearly observed by 
comparing Figs. 85 86 with Figs. 8T][82 Unlike the findings for m±^\Y, it particularly can be noticed 



that the scale dependence associated with the one-jet observables shown here remains unchanged when 
compared to the respective MePs results. 



Fig. 87 depicts the distribution of the inclusive VF + n jet cross sections as obtained for the 
MeNloPs case. Using the above reasoning, one can understand these results as for the one-jet vari- 
ables. Note that the scale dependence of the zeroth jet bin shows the expected decrease owing to the 
NLO accuracy underlying the description of the core process. 

Fig. [88] finally, highlights the evolution and uncertainties of two definition of the beamthrust, cf . 



App. 18. 6[ a physical observable summing over all final state particles excluding the 1^-constituent lep- 



ton and a pseudo-observable including the W itself. For both observables small perturbative uncertainties 
are completely hurried underneath much larger non-perturbative effects and modelling uncertainties, an 



effect also seen in results from the POWHEG Box-i-Pythia simulation, cf. Fig. 76 This can only be in- 
terpreted as this observable being dominated by non-perturbative effects and in particular the underlying 
event, which somewhat invalidates statements about the merit of this observable in a clean determination 
of initial state radiation effects made in |.469il26l . 
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Sherpa MENLOPS matrix element level, CMS cuts 
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Fig. 84: Sherpa MeNloPs. Uncertainty of the transverse mass of the reconstructed W on the matrix 
element level (upper left), after parton showering (upper right), including hadronisation correction (centre 
left), multiple parton interactions (centre right), and QED corrections (lower left). The lower right panel 
shows the evolution of the central value. 
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Sherpa MENLOPS matrix element level, CMS cuts 



Slierpa MENLOPS parton shower level. CMS cuts 
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Fig. 85: Sherpa MeNloPS. Uncertainty of the angular separation of the charged lepton and the hardest 
jet on the matrix element level (upper left), after parton showering (upper right), including hadronisation 
correction (centre left), multiple parton interactions (centre right), and QED corrections (lower left). The 
lower right panel shows the evolution of the central value. 
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Fig. 86: Sherpa MeNloPs. Uncertainty of the transverse momentum of tlie liardest jet on the matrix 
element level (upper left), after parton showering (upper right), including hadronisation correction (centre 
left), multiple parton interactions (centre right), and QED corrections (lower left). The lower right panel 
shows the evolution of the central value. 
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Fig. 87: Sherpa MeNloPs. Uncertainty of the inclusive jet multiplicity on the matrix element level (up- 
per left), after parton showering (upper right), including hadronisation correction (centre left), multiple 
parton interactions (centre right), and QED corrections (lower left). The lower right panel shows the 
evolution of the central value. 
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Fig. 88: Sherpa MeNloPs. Evolution and uncertainty of two definitions of the beamthrust, calculated 
using all particles not constituting the W (left) and including the W (right). Exemplary, the combined 
PDF and scale uncertainty on the matrix element level prediction (yellow) and the modeling uncertainty 
of the hadron level prediction (blue) are shown. 



18.4 Comparisons 

In this section we compare the results of different tools with each other. While the aim of this study was 
to have a fairly tuned comparison with as many aspects of the calculations as possible being centrally 
defined, there are still important residual differences in the various results. Obviously, the different 
codes produced results at different stages of the simulation, which are not always directly comparable; 
in addition, some of these stages are not very straightforward to obtain: for instance, running PYTHIA8 
without multiple parton interactions included in the interleaved showering obviously changes the overall 
logic of the parton shower model of this code. In addition, other, more obvious differences occur, ranging 
from inconsistent choices of PDFs to different strategies in scale setting procedures. For the case of 
the PDFs, by directly comparing results obtained with BlackHat+Sherpa using CTEQ6.6 and with 
GoSam+Sherpa using CTIO, it appears as if at NLO these differences are minor However, it is not clear 
how much of the differences between MadGraph+Pythia and Pythia8, which both employ CTEQ6L1, 
and the other codes, which employ NLO PDFs, can be attributed to differences in PDFs. 

In addition, results obtained with the NLO codes typically include at least one jet - POWHEG 
BOX+PYTHIA8 and GoSam+Sherpa take H^ + 1 jet at NLO as their core process - while Hej starts at 
W + 2 jets, and BlackHat+Sherpa presents results for up to 4 jets accompanying the W boson in 
different jet bins. Obviously, on the other hand, the multijet merged samples of MadGraph+Pythia, 
Pythia8 MePs and Sherpa include LO matrix elements for up to 3 to 6 jets. 

In the plots in this section each code is shown with a yellow error band, which is the envelope of 



the variations presented in Sec. 18.3 The only exception is BlackHat+Sherpa, which is shown with a 
blue error band. In the ratio plots the codes are plotted relative to BlackHat+Sherpa, also at the parton 
shower level. 



18.41 Inclusive observables 

In this section we present some inclusive observables, which are typically all obtained from codes em- 
ploying multijet merging. By and large, all codes agree in the shapes of the m^ vv distribution at different 
stages, although there are sizable differences in the respective normalisation of the samples. 
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Fig. 89: Transverse mass of the reconstructed W on all levels of the simulation, for the exact defi- 
nition see App. [18.62 and for the cuts employed in the analysis App. 18.61 Note that PYTHIA8 and 
MadGraph+Pythia use the CTEQ6L1 pdf, while Sherpa uses CTIO. 
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18.42 Observables with at least one jet 

As a first and fairly telling observable the p^ -spectrum of the hardest jet is compared, cf. Fig. 
the parton level, the results of the NLO calculations - BlackHat-i-Sherpa and GoSam-i-Sherpa- agree 
nearly perfectly with each other and within about 20% with the multijet merged samples of Sherpa, 
both at LO (Sherpa MePs) and in the MeNloPs (Sherpa MeNloPs) sample. The increase of the 
latter with respect to the former at relatively low transverse momenta of about 50 GeV or below can 
probably be related to the different scale definition in the argument of the strong coupling, where the 
NLO calculations choose /i|, = {H'j,/2) ?» M^/4+p^ ■ while in the Sherpa simulation the transverse 
momentum of the jet has been chosen. Clearly, for small transverse momenta this will lead to visible 
differences. Going from the matrix element to the parton shower level typically leads to the jets becoming 
softer and to losing some of them, due to partons emitted outside the jet and a corresponding energy loss. 
This explains why the Sherpa distribution at the shower level is softer than the NLO result, and thus 
the Sherpa result at the matrix element level, although the size of the difference seems to be larger than 
one would naively expect. This finding is, however, somewhat at odds with the results obtained from 
MadGraph-i-Pythia, which seem to be slightly harder in shape and significantly larger in normalisation. 
The Pythia8 MePs sample, on the other hand, has a smaller one-jet inclusive cross section than Sherpa, 
but the jet spectrum exhibits a somewhat harder tail, corresponding to a shape difference of about 30- 
40% with respect to both the Sherpa results. The same finding, a somewhat harder tail, is also true 
for the POWHEG Box-i-Pythia8 results. The same trends can be also found at the hadron and hadron -i- 
MPI level. For the PowHEG Box result the difference can be attributed to the usage of a scale defined 



at the "underlying-Born" level (cf. Sec. 18.25 for more details). Indeed it has been checked explicitly 
that a NLO computation performed with the same scale choice used in PowHEG Box gives a result in 
complete agreement with the Powheg Box result shown here. Clearly, the differences between different 
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Fig. 90: Transverse momentum of hardest jet on all levels of the simulation, where jets are reconstructed 
using the anti-fci^ with R = 0.4 within |?7| < 4.4 (for exact definitions and cuts see App. 18.61 and 



App. |18.62[ ). Note that BlackHat uses the CTEQ6.6 pdf, PythiaB and MadGraph+Pythia CTEQ6L1 
and all the others use CTIO. In both ratio plots the ratio is taken with respect to BlackHat+Sherpa (on 
matrix element level). 



calculations and codes exhibited here deserve a more in-depth study, which, unfortunately, is beyond the 
scope of this comparison. 

Similar findings are also true for the next observable, the AR distribution between the lepton 



stemming from the W decay and the hardest jet displayed in Fig. 91 Again, the two Sherpa samples 
are compared with the two NLO samples, this time exhibiting a sizable shape difference towards an 
increase at smaller and a decrease at larger distances of about 40% relative cross section. While higher 
jet configurations typically tend to be a bit more central, it seems far-fetched to attribute this difference 
only to them. At the same time, large differences in R are most likely due to jets which are pretty much 
forwarq^ This region of phase space for jet production, however, is known to be quite susceptible to 
mismatches in scale and/or PDF definitions. However, it is worth noting that this difference vanishes 
almost completely at the parton shower level. The PythiaS MePs sample, despite a sizable difference 
in cross section, appears to follow the shape of the NLO and Sherpa results. Further comparing these 
results to those of the other codes at the shower level suggests that the MadGraph-i-Pythia merged 
sample, apart from a drastically enhanced cross section, also shows an enhancement in shape at smaller 
AR < 2 w.r.t. the NLO result. Interestingly enough, the POWHEG Box-i-Pythia8 sample exhibits the 



{Af,r 



Assuming the lepton and the jet to be back-to-back, A(f> — it, one still needs Ai] ~ 4 to obtain (AR)'^ — {A(j>)^ + 
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Fig. 91: AR between hardest lepton and hardest jet on all levels of the simulation (for exact definitions 
and cuts see App. |18.6l] and App. |18.62[ ). Note that BlackHat uses the CTEQ6.6 pdf, PythiaS and 
MadGraph+Pythia CTEQ6L1 and all the others use CTIO. In both ratio plots the ratio is taken with 
respect to BlackHat+Sherpa (on matrix element level). 



opposite behaviour: while the cross section seems fairly consistent with the Sherpa and the NLO ones, 
the shape shows some enhancement of up to 40% at large distance AR, which following the reasoning 
for the jet-px spectrum may also hint at being due to a difference in the definition of scales. As before, 
the same trends visible at the parton shower level can also be found at the hadron and hadron + MPI 
level. 



18.43 Multi-jet observables 



In observables including at least two jets, consider first the case of the Ht distribution depicted in Fig. 92 



Over the full range and obscured by large statistical fluctuations both Sherpa samples seem to follow 
the NLO prediction from BlackHat+Sherpa. The LO result from GoSam+Sherpa, on the other hand, 
appears to fall off at the hard end of the distribution. The prediction from Hej is a bit more subtle to 
judge: at low Ht (around 100 GeV), we see that it is in good agreement with the predictions from the 
other approaches. However, as higher values of Ht are probed, the Hej prediction becomes noticeably 
larger than the fixed-order descriptions, including those from Sherpa where different multiplicities are 
merged. This is the region in Ht where we would expect high multiplicities to have a noticeable effect, 
and therefore where we would expect to see the impact of the resummation in Hej. This is, however, 
slighlty at odds with the fact that the Sherpa prediction included up to 6 jets and that the multijet rates 
and the px distributions of the fifth and sixth jet from Hej undershoot those from Sherpa, cf. Fig. 93 
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Fig. 92: Ht = J2ieUets\ -^-L* °f events with at least 2 jets on all levels of the simulation (for exact 



definitions and cuts see App. |18.61| and App. |18.62| ). Note that BlackHat uses the CTEQ6.6 pdf, 
PythiaS and MadGraph+Pythia CTEQ6L1 and all the others use CTIO. In both ratio plots the ratio 
is taken with respect to BlackHat+Sherpa (on matrix element level). 



and Fig. 95 However, a similar trend concerning the hard tail of this distribution appears also on the 
shower level in the MadGraph+Pythia sample, which includes up to 4 extra jets, and in the POWHEG 
Box+Pythia8 sample, which includes 2 jets at LO and 1 jet at NLO. The trend is even more pronounced 
with an even harder tail for the PythiaS MePs sample, which includes 3 extra jets. At this level, Sherpa 
more or less follows the NLO result. It should be noted, though, that all approaches remain within the 
scale variation band indicated on the BlackHat prediction. This findings are consistently carried over to 
the hadron and hadron+MPI level. 

Turning to the n-jet rates, at the matrix element level, Sherpa follows fairly closely the NLO 
results in different jet multiplicity bins, while Hej seem to overshoot the central value in the 3- and 4-jet 
bin, but staying inside the NLO scale uncertainty band, going back to the tree-level result of Sherpa in 
the 5- and 6-jet bins. As discussed in Sec. 18.33[ the Hej framework includes tree-level matching for 
final states with up to and including four jets in the final state. Therefore it is fair to assume that the 
absence of matching for five jets and above leads to the larger drop in cross section observed in Fig. 93 
from four-jet to five-jet as compared to that from either three-jet to four-jet or from five-jet to six-jet 
and lends support to the suspicion that in Hej a matched sample would also provide larger 5- and 6-jet 
multiplicities. At the shower level, the trend already visible at the Ht distribution repeats itself. The 
smaller cross section in the PythiaS MePs sample is mainly due to the low multiplicity bins, such that 
the shape of the n-jet distribution also has a relatively harder tail than the Sherpa sample. In contrast. 
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Fig. 93: Number of jets on all levels of the simulation (for exact definitions and cuts see App. 18.61 



and App. |18.62| ). Note that POWHEG Box+Pythia8 and GoSam calculate VF + 1 jet on matrix ele- 
ment level, while Hej starts with W + 2 jets and that BlackHat uses the CTEQ6.6 pdf, PYTHIA8 and 
MadGraph+Pythia CTEQ6L1 and all the others use CTIO. In both ratio plots the ratio is taken with 
respect to BlackHat+Sherpa (on matrix element level). 



the PowHEG Box +Pythia8 result, starting consistently at 1 jet, appear to be at the upper end of the 
NLO uncertanities throughout. 



Looking at the correlation of the two leading jets in Fig. 94 at the matrix element only, both 
the AR and the mi2 distribution provided by Sherpa have a slight tilt against the NLO prediction from 
BlackHat+ Sherpa, undershooting the latter result by up to about 40% for large AR and by up to about 
20% for large 77112. While Hej seems to roughly follow the shape of Sherpa for AR, it is significantly 
harder than Sherpa and the NLO result for large values of 7n,i2. In addition, in both cases, Hej also 
predicts a larger cross section that the other tools. 



Fig. 95 shows the transverse momentum distributions for the third to sixth jets ordered in p±, and 
at the matrix element level. For the third hardest jet, the prediction from Hej is similar in shape but higher 
in cross section than the results obtained at NLO from BlackHat+Sherpa or the two Sherpa samples. 
For the fourth jet, the Hej cross section still seems higher than the other ones, but this discrepancy seems 
to be mainly around comparably low jet p±. For larger values of p± all tree-level type or resummed 
predictions aie below the NLO result. Surprisingly, for the fifth and sixth hardest jets, the Hej predictions 
follow the Sherpa ones for low values of p± below about 60 GeV, before they fall off nearly instantly. 
This again may be an artefact of tree-level matching not being included in Hej for the production of five- 
and six-jets or of missing statistical support in this region of phase space. 
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Fig. 94: AR of two leading jets (left) and invariant mass of two hardest jets (right) matrix element level 



(for exact definitions and cuts see App. |18.61] and App. |18.62| ). Note that BlackHat uses the CTEQ6.6 
pdf, PythiaS and MadGraph+Pythia CTEQ6L1 and all the others use CTIO. In both ratio plots the 
ratio is taken with respect to BlackHat+Sherpa (on matrix element level). 
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Fig. 95: Transverse momentum of third to sixth hardest jet on matrix element level (for exact definitions 
and cuts see App. |18.6f] and App. |18.62| ). Note that BlackHat uses the CTEQ6.6 pdf, PythiaS and 
MadGraph+Pythia CTEQ6L1 and all the others use CTIO. In both ratio plots the ratio is taken with 
respect to BlackHat+Sherpa (on matrix element level). 
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18.5 Conclusions 

In this study first steps towards an update and extension of the comparison in 11439 1 have been made. 
In contrast to the older study, a larger variety of tools including fixed-order and resummation tools as 
well as NLO matched and tree-level merged simulations have been included. Not surprisingly, some 
observables appear to be described fairly consistently between different tools, while others exhibit large 
deviations, sometimes clearly beyond the formal accuracy claimed by the different methods, and also 
beyond the best estimates of intrinsic modelling or calculational uncertainties provided by the authors. 
In some instances the relative differences are way beyond naive expectations by most of the authors of 
this study. This clearly hints at the need to carefully cross-validate different tools before deploying them 
for large scale simulations, and it also necessitates an increased collaboration of the authors of such tools 
in order to arrive at a more consistent picture. 

We hope that this study triggered some future work towards the latter goal. 
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18.6 Cuts and observables 

18.61 Cuts 





ATLAS 


CMS 


lepton px 


> 20 GeV 


> 20 GeV 


lepton \ri\ 


< 2.5 (e, fi) 


< 2.5(e), 2.1 (^) 


$± 


> 25 GeV 


no cut 


m±^w 


> 40 GeV 


> 20 GeV 


jetp± 


> 25 GeV 


> 30 GeV 


jet \rj\ 


<4.4 


<2.4 


jet radius 


0.4 (anti-A;_L) 


0.5 (anti-A;_L) 


lepton isolation 


< 10 % of lepton energy 


< 10 % of lepton energy 




in cone with R=0.5 


in cone with R=0.5 



Table 13: Cuts used in this study inspired by common ATLAS and CMS cuts. 

Tab.[T3]presents the cuts applied to define the event selection in both the ATLAS and CMS speci- 
fications. 



18.62 Analysis procedure and definition of observables 

A common analysis was implemented within the RiVET framework and used by all codes providing 
individual events. This analyses is carried out as defined in the following: 
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1. remove all neutrinos from all final states (i.e. 'all particles' from now on means 'all particles 
without neutrinos') 

2. find hardest isolated lepton (electron or muon) ('lepton' from now on means 'hardest isolated 
lepton') 

3. cut on lepton p± and |?7| 

4. compute missing transverse energy $_i : 

(a) sum the three-momenta of all particles within |r/| < 10, this yields —p 

(b) compute missing energy as ^ = \p\ 

(c) assume resulting four- vector p corresponds to neutrino 

5. for ATLAS cut on |;^ 

6. resonstruct W four-momentum as p^ = n'^^pt™ 



/ lepton , 



?-{p1) 



W\2 



7. compute W transverse mass squared as m?^ y^ 

8. cut on W transverse mass 

9. remove lepton from final state 

10. cluster into jets keeping only those passing the p_i_ and \ri\ cuts 

11. compute Ht = Eie{jets} ^-Li 

12. compute beam thrust tb = Z^jeipaiticies} i^i ~ \Pi\) using all visible particles 

It should be noted that this defintion of the W is infra-red safe only for transverse observables. 



18.7 Detailed settings 

18.71 Sherpa 

For this study Sherpa-1.3.1 was used. Except for the underlying event, which was tuned for the 
CTIO [12551 parton distribution functions and whose parameters are given below, all other non- 
perturbative parameters were kept at their default values. The underlying model was tuned for the cluster 
hadronisation. 



K_PERP^EAN_1 


1.17 


K_PERPJVIEAN_2 


1.17 


K_PERP_SIGMA_1 


0.760 


KJ'ERPSIGMA^ 


0.760 


PROFILE_PARAMETERS 


0.576,0.353 


RESCALEXXPONENT 


0.238 


SCALE JVIIN 


2.52 


SIGMAJ4D .FACTOR 


0.465 



18.72 PYTH1A8 

To produce the results, we have used two tunes of PYTHIA8, Tune 4C and Tune A2, both of which use 
CTEQ6L1 parton distributions. Tune 4C is the default tune in PYTHIA8- no additional input settings are 
necessary. For completeness, below we list all parameters that are implicitly set by choosing the default 
Tune 4C. 

PDF:pSet = 8 

SigmaProcess : alphaSvalue = 0.135 
SigmaDiff ractive : dampen = on 
SigmaDif f ractive :maxXB = 65.0 
SigmaDiff ractive :maxAX = 65.0 
SigmaDif f ractive :maxXX = 65.0 
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TimeShower : dampenBeamRecoil = on 
TimeShower :phiPolAsym = on 
SpaceShower : alphaSvalue = 0.137 
SpaceShower : samePTasMPI = false 
SpaceShower :pTORef =2.0 
SpaceShower : ecmRef = 1800.0 
SpaceShower : ecmPow = 0.0 
SpaceShower : rapidityOrder = on 
SpaceShower :phiPolAsym = on 
SpaceShower :phiIntAsym = on 

Multipartonlnteractions : alphaSvalue = 0.135 
Multipartonlnteractions :pTORef = 2.085 
Multipartonlnteractions : ecmRef = 1800. 
Multipartonlnteractions : ecmPow = 0.19 
Multipartonlnteractions :bProf lie = 3 
Multipartonlnteractions : expPow = 2.0 
BeamRemnants :primordialKTsoft = 0.5 
BeamRemnants :primordialKThard = 2.0 
BeamRemnants : half ScaleForKT = 1.0 
BeamRemnants ihalfMassForKT = 1.0 
BeamRemnants : reconnectRange = 1.5 

A detailed discussion of these choices can be found in II455II . All other parameters remain with their 
default values. For our purposes, it might be interesting to remark that the starting value for ac,-evolution 
in time-like splittings is given by 

SpaceShower : alphaSvalue = 0.1383 

To investigate the impact of rapidity ordering in space-like showers, we chose to remove enforced rapidity 
ordering by setting 

SpaceShower : rapidityOrder = off 

If rapidity ordering is enforced in ISR, the question arises how it should be treated when picking histories. 
For this purpose, PythiaS supplies the switch 

Merging :enforceStrongOrde ring 

When switched "on", this parameter will result in picking non-rapidity-ordered histories only if no 
rapidity-ordered paths where found, thus disfavouring non-rapidity-ordered parton shower histories for 
matrix element states. To have a more complete understanding of the impact of tuning, we also changed 
to the recently proposed Tune A2 II458L For this, we have to set 

Tune:pp = 7 

PythiaS will then reset the following parameters: 

PDF:pSet = 8 

SigmaProcess : alphaSvalue = 0.135 
SigmaDif tractive : dampen = on 
SigmaDif tractive :maxXB = 65.0 
SigmaDif tractive :maxAX = 65.0 
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SigmaDif f ractive :maxXX = 65.0 
TimeShower : dampenBeamRecoil = on 
TimeShower :phiPolAsym = on 
SpaceShower : alphaSvalue = 0.137 
SpaceShower : samePTasMPI = false 
SpaceShower :pTORef =2.0 
SpaceShower : ecmRef = 1800.0 
SpaceShower : ecmPow = 0.0 
SpaceShower : rapidityOrder = false 
SpaceShower :phiPolAsym = on 
SpaceShower :phiIntAsym = on 
Multipartonlnteractions : alphaSvalue = 
Multipartonlnteractions :pTORef = 2.18 
Multipartonlnteractions : ecmRef = 1800 
Multipartonlnteractions : ecmPow = 0.22 
Multipartonlnteractions :bProf lie = 4 
Multipartonlnteractions : al = 0.06 
BeamRemnants :primordialKTsoft = 0.5 
BeamRemnants :primordialKThard = 2.0 
BeamRemnants : half ScaleForKT = 1.0 
BeamRemnants : halfMassForKT = 1.0 
BeamRemnants : reconnectRange = 1.55 



0.135 



Apart from not enforcing rapidity ordering in space-like splittings, this tune differs from Tune 4C in that 
the proton size is considered a;— dependent. This is in the spirit of Tune 4CX, which was introduced 
in [470]. In general, since we include matrix element states for two and three jets, we do not apply 
additional matrix element corrections in PythiaS after the first emission, by setting 

SpaceShower :MEafterFirst = off 
TimeShower :MEafterFirst = off 

1 8. 73 PowHEG Box + PythiaS 

For this study we used PowHEG Box revl282 and Pythia 8.153. Except for the specific subprocess 
requested, the parton distribution functions set and the renormalisation/factorisation scale factors chosen, 
all the other parameters were kept fixed below during all the runs. Here is a sample POWHEG Box input 
file: 



! W+ + jet product 
idvecbos 24 
vdecaymode 1 
numevts 4000000 
ihl 1 
ih2 1 

ebeaml 3500d0 
ebeam2 3500d0 
Ihansl 192800 
lhans2 192800 
ncalll 100000 
itmxl 5 
ncall2 250000 



ion paramete 
PDG id of 
decay cha 
number of 
hadron 1 
hadron 2 
energy of 
energy of 
pdf set f 
pdf set f 
number of 
number of 
number of 



vector boson (24: W+, -24: W-) 
nnel (1: electron, 2: muon, 3: tau) 

events to be generated 
(1 for protons, -1 for antiprotons) 
(1 for protons, -1 for antiprotons) 

beam 1 in GeV 

beam 2 in GeV 
or hadron 1 (LHA numbering) 
or hadron 2 (LHA numbering) 

calls for initializing the . . . 

iterations for initializing the . . 

calls for computing the integral . 
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itmx2 4 
foldcsi 1 
foldy 1 
foldphi 1 
nubound 100000 
icsimax 1 
iymax 1 
xupbound 2d0 
renscfact IdO 
facscfact IdO 
withdamp 1 
iseed 12345679 
bornktmin 5d0 
bornsuppfact lOOdO 
withnegweights 1 
runningscale 1 
masswindow_low 1000 
masswindow_high 1000 



number of iterations for computing the . . . 
number of folds on csi integration 
number of folds on y integration 
number of folds on phi integration 
number of bbarra calls to setup norm of ... 
<= 100, number of csi subdivision when ... 
<= 100, number of y subdivision when . . . 
increase upper bound for radiation generation 
(default IdO) ren scale factor: muren = ... 
(default IdO) fac scale factor: mufact = ... 
(default 0, do not use) use Born-zero ... 
initialize random number sequence 
(default OdO) kt min at Born level for . . . 

(default OdO) mass param for Born ... 
(default 0) allows negative weighted ... 
(default 0) ren. and fact, scales set to ... 
restricts phase space to ... 
restricts phase space to ... 



When interfacing to PythiaS we have changed the following settings with respect to PythiaS 
defaults, for the various stages under investigations: 

///Hadron Level w MPI and QED 

BeamRemnants : reconnectRange = 1.50000 

Multiplelnteractions : alphaSvalue = 0.13500 

Multiplelnteractions :bProf lie = 3 

Multiplelnteractions : ecmPow = 

Multiplelnteractions : expPow = 

Multiplelnteractions :pTORef = 

PDF:pSet = 8 

SigmaDif tractive : dampen = on 

SigmaDif tractive :maxAX 

SigmaDif tractive :maxXB 

SigmaDif tractive :maxXX 

SigmaProcess : alphaSvalue = 

SpaceShower :MEafterFirst = 

SpaceShower :MEcorrections 



0.1900 
2.0000 
2.0850 



65.0000 
65.0000 
65.0000 
= 0.13500 
= off 
off 



SpaceShower :pTmaxMatch = 
SpaceShower : rapidityOrder = on 
TimeShower :MEcorrections = off 
TimeShower :MEafterFirst = off 
TimeShower :pTmaxMatch = 

//Hadron Level w MPI (added) 
SpaceShower : QEDshowerByQ = off 
SpaceShower : QEDshowerByL = off 
TimeShower : QEDshowerByQ = off 
TimeShower : QEDshowerByL = off 



//Hadron Level w/o MPI 
PartonLevel :MI = off 



(added) 
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//Shower Level (added) 
HadronLevel : All = off 

//Parton Level (added) 
PartonLevel : ISR = off 
PartonLevel :FSR = off 
PartonLevel : Remnants = on 



and, most important, we have vetoed shower emissions with a transverse momentum greater than the 
value of SCALUP read from the Les Houches event file for the corresponding event. 



18.74 MadGraph + Pythia 

For this study MadGraph/MadEvent 5.1.1.0 and Pythia 6.4.2.4 is used. The LHE files are generated 
for events with a W and up to four additional partons, i.e. for the process: 



pp>w- -> 1-vl ; 1-vl ~j ; 1-vl jj ;l-vl~ jjj 
l-vl~ ; l-vl~j ; l-vl~jj ; 
1-vl -jjj ; l-vl~ jjjj 



1-vl jjjj ; 



The mass of the b quark is set to zero. The strong constant as{M^) is set to 0.1300 both in the matrix 
element calculation and in the proton PDF, that is the CTEQ6L1. 

Pythia is used for the parton shower and the hadronisation with the following parameters modified 
according to tune Z2. 



MSTU 
MSTJ 
PARJ 
MSTP 
MSTP 
MSTP 
MSTP 
PARP 
PARP 
PARP 
MSTP 
PARP 
PARP 
PARP 
PARP 
PARP 
PARP 
MSTP 
PARP 
MSTP 
MSTP 
PMAS 
PMAS 
MSTJ 
MSTP 



21)=1 

22)=2 

71)=10 . 

33)=0 

2)=1 

51)=10042 

52)=2 

82)=1.832 

89)=1800. 

90)=0.275 

95)=6 

77)=1 

78)=0 

80)=0 

83)=0 

84)=0 

62)=1 

91)=1 

93)=10.0 

81)=21 

82)=4 

5,1)=4.8 

6, 1)=172 

1)=1 

61)=1 



016 

538 

1 

356 

651 

025 



Check on possible errors during program execution 

Decay those unstable particles 

for which ctau 10 mm 

no K factors in hard cross sections 

which order running alphaS 

structure function chosen (external PDF CTEQ6L1) 

work with LHAPDF 

pt cutoff for multiparton interactions 

sqrts for which PARP82 is set 

Multiple interactions: rescaling power 

CR (color reconnection parameters) 

CR 

CR 

Prob. colored parton from BBR 

Multiple interactions: matter distribution para.., 

Multiple interactions: matter distribution para.., 

ISR cutoff 

Gaussian primordial kT 

primordial kT-max 

multiple parton interactions 1 is Pythia default 

Defines the multi-parton model 

b quark mass 

t quark mass 

Fragmentation/hadronization on 

Parton showering on 
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For additional studies, we set 

MSTJ{41)=3 ! switch off lepton FSR 

MSTP{81)=20 ! switch off MPI 

MSTJ{1)=0 ! Fragmentation/hadronization off 

to switch off, respectively, final state QED radiation, multi-particle interactions, and hadronisation. 



Party 

EXPERIMENTAL DEFINITIONS AND 
CORRECTIONS 

19. PHOTON ISOLATION AND FRAGMENTATION CONTRIBUTION 
Abstract 

Photon isolation and its link with the fragmentation contribution is explored 
via NLO matrix-element generator and parton-shower Monte-Carlo. 

Firstly the dependence of the inclusive photon and di-photon NLO cross sec- 
tions to the choice of isolation criteria are investigated. The isolation criteria 
used is the discretized version of the Frixione isolation, with parameters cho- 
sen for those most practical at an experimental level. As an extention, a more 
generalized version of the standard Frixione isolation is also studied. The se- 
lection of scale is also investigated in search of the 'saddle point' , which would 
give the optimal scale choice. In addition the choice of jet algorithm is inves- 
tigated for the photon with associated jet cross section. 

Secondly, properties of the fragmentation contribution in parton-shower 
Monte-Carlos are investigated. The distance profile of the photon to the other 
generator level particles in the event is explored in the case of neutral mesons, 
fragmentation photons and direct photons. Next the impact of a "hollow" or 
"crown" isolation criterion, expected to enhance the fragmentation contribu- 
tion, is explored. Then, to complement the NLO inclusive studies, the impact 
of typical Frixione isolation criteria on the fragmentation component are in- 
vestigated in the paiton-shower Monte-Carlos. 

Finally conclusions are made comparing the properties of the fragmentation 
contribution in NLO generators and parton-shower generators. 

19.1 INTRODUCTION 

Experimental measurements of single photons and di-photons require the application of isolation cuts 
to reduce the copious backgrounds arising from jet fragmentation. Such cuts also have the impact of 
reducing the fragmentation contributions of photon production. On the theoretical side, including frag- 
mentation contributions of photon production can greatly increase the complexity of the calculations, 
while the application of appropriate isolation cuts can effectively remove those fragmentation contribu- 
tions. 



■*' Contributed by:N. Chanon, S. Gascon-Shotkin, J. P. Guillet, J. Huston, E. Pilon, M. Schwoerer, M. Stockton, M. Tripiana 
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19.11 Frixione isolation 

In the following we will study the Frixione isolation criterion 1147 II . which was designed to suppress 
the fragmentation contribution. It has been shown to reduce the fragmentation contribution in NLO 
generators |18| . The question is to know whether or not the behavior is still applicable using parton 
shower Monte-Carlo and if it can be used experimentally. 

We consider the following function for the isolation criterion : 



Er{R)<f{R) = e-pT,^- 



1 - cos{R) 
1 — cos{Rq) 



(19.1.1) 



where E^°{R) is the isolation sum of all particles inside a cone of R = ^/A(f^ + Arf around the 
photon, e is the strength or the scale of the isolation criterion, pT^-y is the transverse energy of the photon, 
i?o is the first considered cone and n the power of the isolation criterion. This formula can be altered by 
replacing pT,-y with a fixed threshold. Functional forms f{R) for different e and n are shown Fig. 96 




0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 

R 



Fig. 96: Examples of Frixione functional form for different parameters. 



The Frixione isolation is tighter and tighter when decreasing the R cone size. With matrix element 
NLO generators (as in Jetphox [4721 I473II and Diphox II474II '). the only contribution possible at a given 
R y^ Ois the one coming from the suplementary hard jets in the event, while the fragmentation debris are 
emitted colinearly to the photon at i? = (angle information is lost due to the fragmentation function 
which is integrated over the angle). With parton-shower generator, fragmentation photons are emitted 
off quarks at a non-zero angle during the showering process. In the following sections we will study the 
link between isolation and fragmentation in NLO generators, then with parton-shower Monte-Carlo. 

19.12 Experimental complications 

There are various mismatches between isolation cuts applied to theoretical calculations and isolation cuts 
applied to data (or to Monte Carlo). First of all, we wish to apply the isolation cut only to energy related 
to the hard scatter. Experimentally, most of the energy inside an isolation cone is due to the underlying 
event associated with the hard scatter, or the remnants of additional interactions in the same crossing. 
Techniques such as jet area subtraction 11475 [ 14761 can be used to remove an amount of energy from 
the isolation cone roughly equal to the expected contamination from underlying event/pileup, leaving 
only energy related to the hard scatter and specifically to fragmentation processes. Since there is no 
underlying event/pileup in partonic level theory calculations, only the isolation cut needs to be applied 
to the theory. 
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Full details of the use of this correction within the ATLAS collaboration can be found in the 
inclusive cross section measurement f477|. The ATLAS isolation definition uses a cone around the 
cluster of cells that are identified as a photon. These photon cluster cells are not included in the sum, 
so there is first a correction for any leakage of the photon shower into the surrounding cells (typically a 
few percent of the photon px)- The pile-up/underlying event correction is then applied by calculating per 
event the ambient energy from the jet activity in that specific event. This follows the jet area connections 
method mentioned above, where all jets are reconstructed without any minimum momentum threshold. 
The energy density of each jet is calculated and the median density is used for the correction. In 2010 
this typically resulted in a correction of around 900MeV. 

The original Frixione isolation scheme assumed that an isolation cut could be applied continuously 
as a function of R (distance from the photon). Actual detectors have a finite granularity. A solution to 
this was the adoption of a discretized version of Frixione isolation, allowing this granularity to be taken 
into account |[T8l . However, it is not possible to place an isolation cut on the inner-most cone (typically 
R ~ 0.1), because of the presence of the photon itself. While the separation between the fragmentation 
photon and the jet remnants is finite in data (and in Monte Carlo) , fragmentation is treated as a coUinear 
process in partonic cross sections. The inability to apply the isolation cut down to R = 0, results in a 
greatly reduced ability to discriminate against fragmentation processes in the partonic level theory. 

To rectify this the Frixione calculation could be modified into a 'crown' isolation, whereby the last 
cone is missed from the calculation. Unfortunately as most of the radiation is collinear in the fragmen- 
tation events, it is likely to reduce its effect of removing these events. Other studies |478 1 have shown 
that the photon quality cuts applied by the experiments will reduce the fraction of fragmentation photons 
accepted, where substantial fragmentation energy is collinear with the photon. However, the rejection is 
not 100%, so we are still left with a smaller reduction of fragmentation contributions in the partonic level 
theory than are actually (presumably) present in the data. In these proceedings, we will discuss how to 
more properly incorporate the correct level of rejection in the theory. 

19.13 Choice of fragmentation scale 

In addition to the experimental difficulties with applying the isolation criteria there are also difficulties in 
choosing appropriate scales for the theoretical calculation. This is discussed in the following text, along 
with other considerations for applying Frixione isolation at a theoretical level. 

Fragmentation is treated as a collinear process in partonic calculations. In this framework, the orig- 



inal "continuous" Frixione criterion 1147 111 was designed to inhibit the appearance of final state photon- 
parton collinear singularities which otherwise require absorption in a fragmentation function D{z, Mp). 
Thus, cross sections for the production of prompt photons isolated with this criterion involve no frag- 
mentation contribution. Discretized variants of this criterion have been proposed which aim at matching 
better what can be actually implemented experimentally |[T8l . They consist in a limited number of nested 
cones Cj=i^... ^n with respective radii Ri = Rmin < R2 < ■ < Rn = Rmax defined in the azimuthal 
and rapidity differences with respect to the photon direction, and requiring recursively that the accom- 
panying hadronic transverse energies inside every successive cone Cj be less than an ordered sequence 
of maximum values^ i?y° such that < i?^° < • • • < El^"^. However, in contrast with the contin- 
uous criterion, such aiscretized variants still involve a fragmentation contribution, though the latter is 
expected to be small, since the situation in the innermost cone shares some similarity with the standard 
cone criterion. When quantifying the magnitude of the fragmentation contribution with such discretized 
criteria, a potentially tricky issue concerns the fragmentation scale dependence and the "best choice" of 
scale. 

This issue matters for isolation with the standard cone criterion when the radius R of the cone 
is ^ 1 while E1f° is kept fixed. Whereas the natural fragmentation scale Mp in the non-isolated case 



''"'whether the _By ° take fixed values or are functions of the photon's transverse momentum does not matter here. 
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is ~ p^, this choice can lead to very poor theoretical estimates at Next to Leading Order (NLO) in 
perturbative QCD when R <^ 1 II472II . The scale dependence near the choice Mp ~ pj, is then large 
and, worse, the theoretical prediction may eventually exhibit an unphysical violation of unitarity whereby 
the predicted NLO cross section for photons becomes larger than the inclusive one, so that even for only 
moderately small R the reliability of the prediction is questionable. On the other hand, as^L'(z, Mp) ~ 
log{MF/AQCD), with the choice Mp ~ Rpp the fragmentation contribution is suppressed compared 
with Mp ~ p^. The situation is improved regarding both scale dependence and unitarity, although it 
does not solve the problem completely. One actually faces a multiscale problem: Aqcd *^ Rpp *^ Pp, 
and a one-scale compromise is possibly insufficient depending on the kinematical regime explored. The 
atypical choice Mp ~ Rpj, has in principle to be supplemented by a resummation of the logarithmic R 
dependence coming form outside the cone, if at all possible. At leading-log R (LLR) accuracy at least, 
such a resummation is actually feasible, which furthermore allows to solve the apparent puzzle why scale 
choices should be very different in the cases with isolation in a narrow cone vs. broad cone or without 
isolation. 

The concern about the discretized Frixione criteria is that the innermost cone size is quite small. 
The choice for the fragmentation scale Mp shall then arguably be Mp ~ 0{RminPT)- On the other 
hand, as the allowed transverse energy deposit E^"{Rmin) inside this cone is correspondingly small, 
the width of the interval in the fragmentation variable on which the fragmentation function is convoluted 
with the partonic cross section is restricted to a rather narrow range < 1 — z < E^° (Rmin) / Pp ~ 
e{Rmin / Rmax)^ ■ This Icads to a quite suppressed fragmentation contribution. The combination of the 
two effects: a low fragmentation scale and a narrow z-range, is the discrete counterpart of the inhibition 
of fragmentation by the continuous criterion. We may thus expect that the issue of the narrow cone is 
less worrying for the reliability of the NLO calculation in this case than if only R were taken small while 
keeping E}^° fixed. In order to assess the uncertainty on the fragmentation contribution we may perform 
the calculation for the "arguably better" scale Mp ~ RminPT and compare it to the expectedly larger 
result for the standard choice Mp ~ Py/2. 

19.2 ISOLATION FOR INCLUSIVE PHOTONS AND DIPHOTONS AT NLO 

The study at NLO uses the Jetphox generator to calculate the inclusive photon cross section and Diphox 
for the di-photon cross section. Details of how to use the software and to obtain predictions with errors 



can be found in II480II and the selection criteria used are listed in the appendix. Previous results from Les 
Houches fTSl showed that the discretized Frixione isolation criteria did manage to reduce the fragmen- 
tation contribution, here we extend that study in several ways. Firstly the cross section returned has been 
compared to that calculated from using the standard cone isolation, as used in current measurements. A 
generahzed form of the Frixione isolation is discussed, aimed to satisfy both the experimental and the 
theoretical requirements on the isolation cut for different pp regimes. In addition the effects of changing 
the number of cones used in the calculation and of choosing an Ep cut, rather than relating it to the pho- 
ton pp, aie investigated. In addition, further complications to comparing theoretical and experimental 
isolation calculations are discussed. Finally there are further brief studies using Jetphox to look at scale 
and jet algorithm choices. 



19.21 Discretized prescription 



The parameters used to define different selections, according to Eq. 19.1.1 



were: 



a: e=0.05 n=0.2 b: e=l n=0.2 c: e=l n=l d: e=0.5 n=I 
e: e=0.05 n=l /: e=l n=OA g: e=l n=0.5 



""The logarithmic behiaviour holds in when Mp ^ Aqcd- In the non-perturbative regime instead no logarithm is expected 



to develop and one expects rather a power-suppressed behaviour; see ref. 14791 for more details 
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Fig. 97: Left: Fragmentation fraction for the cone and Frixione isolation criteria. Right: The applied Et 
cut on the isolation sum as a function of px for the 0.4 cone or 0. 1 cone in the Frixione criteria. 



where all but the last two were based on the previous study. In all cases Rq was chosen to be 0.4, 
with R being set to either: 0.4, 0.3, 0.2, 0.1 or 0.4, 0.35, 0.3, 0.25, 0.2, 0.15, 0.1. 



The comparison to the cone isolation in Fig. 97 shows that out of the chosen parameters only 1 set 



removes the fragmentation contribution more than what is removed by the cone algorithm, although two 
are lower until high pT- It also shows that criteria b and / are not much better than applying no isolation 
criteria at all. When altering Eq. 19.1.1 to use a fixed Et = 4GeV instead of pT,^ the results are more 



promising but this is because it applies a cut in the 0. 1 cone that is below the experimental accuracy (of 



the order 100 MeV due to detector resolution/noise). Unfortunately Fig. 97 also shows that this is also 
the case for the pT requirements, as case e (the only criteria to perform better than the standard cone) 
also applies a cut that is not viable experimentally in the 0. 1 cone. 

There are some positive outcomes from these studies, firstly the Frixione criteria b and d maybe 
useful criteria to use experimentally as they keep the fragmentation contribution similar and low in all 
bins, which could help with understanding of the systematic errors/correlation between bins. Secondly 
the comparison of the number of cones used in the Frixione criteria resulted in a difference of around 1% 
on the total cross section and almost no effect on the fragmentation fraction. This means that it is fine to 
use the lower number of cones case, and that the discrete Frixione criteria is most likely very similar to 
that of the continuous version. 



19.22 Generalized prescription 

As seen previously, to remove the fragmentation contribution in the theory, a small value of e is needed. 
However, given the effects of finite resolution and granularity on the experimental description of the 
isolation energy, a minimum threshold has to be allowed in the isolation cone, especially at low pT- A 
typical value of 2-4 GeV is used as experimental cut, to optimize the rejection of hadronic background 
coming from the decay of light mesons. Now, at high pT this cut might result too tight, particularly on 
the theoretical side given that an isolation cut much smaller than the photon pT can cause large logs in 
the calculations, this effect was not observed in the previous Les Houches study. 

As a good compromise of these two requirements, it has been proposed 1148 11 to extend the original 
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Fig. 98: Effect of the two terms in the modified Frixione isolation prescription effects of the different 
pieces of the generalized Frixione isolation prescription on the isolation cut as a function of pT (left) and 
the cone radius R (right). 



Frixione prescription (Eq. 19.1.1 1 to a more general form: 



Eif''<({Eo)'' + iE.pT) 



1/fc / 1-cosi? 



1 — COS Ro 



(19.2.1) 



where: 
Rq is the maximum cone size 

Eq is the minimum energy pedestal allowed in a cone of size Rq 
e is the fraction of the photon px allowed in the cone of size Ro 
k determines the shape of the isolation profile in p^ 



n determines the shape of the isolation profile in R (see Fig. 98 right]) 

The extra parameters give enough flexibility to ensure a (finite) tight cut at low pT (~ Eq) and, at 
the same time, a loose cut at high end of the spectrum driven by the photon pT- The k parameter controls 
how quickly/smoothly is the transition from one regime to the other (see Fig. [98|^left]). 

This generalized prescriptiorf^ has been implemented in Jetphox recently and some possible con- 
figurations are explored here. The studied configurations vary e (=0.05,1) and k (=2,5,10), and have a 
fixed value for £^0 = 4GeV (the typical cut applied in ATLAS) and n = 0.5 (given the linear behaviour 
of isolation distribution width observed for direct photons in ATLAS II482II ). 

The high-e configurations (e=l), show a worse performance at removing the fragmentation con- 
tribution with respect to the fixed cone approach and are practically insensitive to the value of k in the 
formula. The remaining fragmentation fraction is ~ 25% at 45 GeV decreasing to 20% in the highest 
Pt bin. On the other hand, as seen in Fig. |99} all the configurations for a low value of e (=0.05) show an 
improvement in fragmentation rejection compared to both the no isolation and fixed cone cases, in the 
whole Pt region (45GeV < px < 600GeV). The pT profile for the smaller cone {R = 0.1) in this case 



(Fig. 99 right]) looks also more promising in terms of its applicability at the experimental level. 



Indeed for Eq = the original Frixione prescription is restored. 
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Fig. 99: Left: Fragmentation fraction for the cone and the (generalized) Frixione isolation criteria. Right: 
The applied isolation cut as a function of pT for 0. 1 cone in the (generalized) Frixione criteria. 



19.23 Continuous and discretized Frixione criteria in di-photon events 

Following the previous studies with inclusive photons, we now consider the production of photon pairs. 
The aims of this study are i) to assess the effect on the magnitude of the fragmentation contribution by 
comparing results from using the continuous Frixione criterion with those using several variants of the 
discretized version, implemented in the NLO programme Diphox, thereby providing a NLO assessment 
of how much fragmentation may be missing in the NNLO calculation of Catani et al. [78] which includes 
no fragmentation and therefore uses the continuous criterion; ii) to probe the dependence of the prediction 
with respect to the fragmentation scale choice. It supplements a similar comparison which had been 
performed for inclusive photon production in ifTSl . 



Fig. |100 provides a comparison of the original continuous criterion to the discretized version of 
the criterion based on four nested cones with respective radii Rmin = 0.1, i?2 = 0.2, i?3 = 0.3 and 
Rmax = 0.4. Four variants of the energy profile EJf°{R) as defined in Eq. 19. 1. 1 have been considered: 
[e,n) = (0.05,0.2), (0.05, 1), (0.5, 1) and (1, 1). Fig. |100| (left) presents the distribution in invariant 
mass of photon pairs in the range 40 GeV < m^^ < 300 GeV. The discretized criterion (0.05, 1) 
suppresses fragmentation so much that there is practically no difference between the discretized and 
continuous versions. With the criterion (1,1), the discretized version leads to a distribution 0(10 — 12%) 
larger than the continuous one. The choice (0.5, 1) displays a similar feature, though quantitatively less 
important. The energy profile of the fourth choice is not suited for an efficient isolation unless e is chosen 



very small. A similar comparison is shown on Fig. 100 (right) for the distribution in the difference in 
azimuthal angle A(p between the two photons. Whereas the distribution in invariant mass is dominated 
by the direct contribution, the tail of the distribution in Z\0 tail at low A(p is more sensitive to the 
fragmentation contribution. Therefore, the conclusions are qualitatively similar to the one drawn for the 
distribution in invariant mass, yet the effects are quantitatively larger. 



Fig. 101 assesses the dependence on the fragmentation scale Alp, for the distributions in invariant 

7I _^7 2-| 



mass (left) and in Acj) (right) respectively. Two choices were considered: Mp = Rmin min {pj , pj, } 



7I 72 



O.lminiff^ , pj. } vs. Mp = R„ 



: min {p^' 



„7 2i 



7I 72 



Pp } = 0.4 min {pj. , pj^ } closer to a standard 



As expected, the distribution in invariant mass, which is not very sensitive to the fragmen 

''^Fragmentation scale options depending only on the pr of the photon from fragmentation are not available because Diphox 
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Fig. 101: Fragmentation scale dependence of the distribution in rriyy (left) and Acf) (right) of photon 
pairs, for four variants of the discretized Frixione criterion. 
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tation contribution, is practically not impacted by the choice. The distribution in Acj) is more sensitive to 
the fragmentation component and the sensitivity to the fragmentation scale choice is larger than for the 
distribution in invariant mass. The sensitivity to the fragmentation scale choice is the largest in the case 
of the criterion (e, n) = (1, 1), for which the predictions are 5-7% smaller, rather uniformly, with the 
lower scale choice than with the more standard one. 

In conclusion this preliminary study shows that the impact of the fragmentation contribution pass- 
ing the discretized criterion seems to be almost negligible on the distribution in invariant mass, and 
remains small even on the tail of the distribution in azimuthal angle. Notwithstanding, the conclusions 
shall have limited use depending on how isolation is actually implemented experimentally in the inner- 
most cone. We here stick to a discretized version of the Frixione criterion which respects the original idea 
of a transverse energy deposit decreasing towards zero with the cone radius. If instead any experimental 
constraint would allow a more permissive condition in the innermost cone, a dedicated study would be 
mandatory. 

19.24 Additional studies at NLO 

In addition to the isolation studies with Jetphox, we present here two brief studies as an attempt to 
reduce the theoretical errors from the NLO calculation. These study the choice of renormalizaion and 
factorization scale parameter and secondly the jet algorithm parameters. 

As studied in II480L the scale choice is set to a fraction of the photon px- By altering this fraction 
around the central value of 1.0, it is hoped to gain an uncertainty on the terms missed in the NLO 
calculation. The best selection for this central value would be to be at a 'saddle point', where moving 
in any direction from this point gives similar changes in the cross section. However, it is found that as 
the scale is reduced (in steps: 2.0, 1.0, 0.5, 0.25, 0.1, 0.05 and 0.01) the cross section increases, when 
moving the two scales coherently or independantly. One difference in this result to the previous study 
was that it was carried out in three pT bins, but the result remained the same for all (only the highest bin 
was able to be calculated with a scale of 0.01). Similarly the addition of using Frixione isolation instead 
of the standard cone isolation also resulted in the same cross section behaviour. The summary of this 
is that there must be large contributions needed from NNLO. However, on the positive side, in all 3 px 
bins, the variation between 0.5-1.0-2.0 resulted in differences of similar magnitude around 1.0, so this is 
likely a safe estimate of the uncertainty. 

After the inclusive photon measurements, the next step experimentally is to require the addition 
of at least one jet. Using a jet of lOGeV the cross section was calculated for two algorithms each for 
multiple sizes: 

• Kt algorithm with AR = 0.3, 0.4, 0.5 or 0.6 

• Cone with AR = 0.4, 0.5 or 0.6 

These choices had an affect of < 1% on the cross section computed in 3 photon pT bins, suggesting that 
this will not increase the error for the NLO calculation when moving from the inclusive cross section to 
that with an additional jet. 

19.3 FRAGMENTATION PHOTONS IN PARTON-SHOWER MONTE-CARLO 

The second part of this study continues to investigate photon isolation, but now in di-photon events using 
parton-shower Monte-Carlo generators; again the selection used is listed in the appendix. The study 
begins by investigating the distance between the photon and other particles. It then moves into studying 
several different styles of isolation criteria, including Frixione criteria as done in the inclusive NLO 
studies. 



encodes the two photons in a symmetrized way. 
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19.31 Topology of fragmentation photons 

We consider three sets of parton-shower Monte-Carlo samples for the 77+X process: 

• Pythia |4001 77 Bom and Box direct processes, plus the Pythia 7+jet process with the jet frag- 
menting into a photon (20 million events were generated for the 7-1-jet sample and 1000 times more 
would have been needed for the dijet fragmenting to two photons due to the low (? — )• 7 branching 
ratio for isolated photons). 

• Pythia 77 Born and Box direct processes, plus the Pythia 7-Hjet process with the jet fragmenting 
into a photon and the Pythia dijet process with the two jets fragmenting into photons. Both Pythia 
7H-jet and dijet samples were generated with a filter which enhances the presence of events with 
isolated electromagnetic particles. 

• Madgraph II 1651 77 -1- up to two supplementary hard jets, with fragmentation/hadronization done 
with Pythia. 

The fragmentation contribution is included as a bremsstrahlung contribution in Madgraph at ma- 
trix element level, while it is included as a showering contribution in Pythia 7-f-jet and dijet (in the 
PYTHIA samples we identify fragmentation photons as those having a quark or gluoi^jas parent). The 
fragmentation fraction found is compatible with ref II483II . We consider additionally the case where the 
two jets fragment into boosted neutral mesons (vr^, r/, p and uj) that can experimentally mimic direct 
or fragmentation photons at reconstructed level because of the finite granularity of the detector. These 
samples include an underlying event but were generated without pile-up. 




0.4 0.5 0.6 

AR(Y,gencand) 



Fig. 102: AR distance distribution between the photon and the other generator-level particle candidates 
in the event, for neutral mesons, fragmentation photons and partonic photons. 



Fig. 102 shows the AR distance between the photon or neutral mesons and the other particle 
candidates in the event. Partonic photons, fragmentation photons and neutral mesons have different 
properties as a function of AR. Partonic photons in Pythia have a linear behavior, which is expected 
because the only contribution that can enter in the isolation sum is the underlying event and pile-up (with 
also a small contribution from QCD radiation at the shower level) which is expected to be uniform in 
space. As each bin consists of an annulus with radius growing linearly as a function of R, the quantity 



Photon radiation directly by a gluon is of course not piiysically possible; the representation as such in PYTHIA is a 
technical shortcut for the actual physical process 
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of particles grows linearly with R in the area of the annulus. Neutral mesons have a radically different 
profile, with a peak of the AR distribution close to 0. The peak is caused by the decay of particles 
resulting from jet fragmentation close to the neutral meson direction. Pythia fragmentation photons 
have a behavior somehow in between that of neutral mesons and partonic photons. The peak at low 
AR is still present but much reduced with respect to that of neutral mesons. Madgraph partonic photons 
exhibit a modulation of the Pythia partonic photon AR distribution, probably because Madgraph includes 
fragmentation as a bremsstrahlung contribution. 

From this we can expect that the smaller the AR cone used in Frixione isolation (until AR ~ 0.1), 
the higher the discrimination against the neutral mesons and fragmentation photons. The discrimination 
against neutral mesons is higher than that against fragmentation photons (as is well-known experimen- 



tally). This can be seen in Fig. 103 which shows the isolation sum profile divided by the transverse 
energy of the photon for different cone sizes. 
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Fig. 103: Isolation sum normalized to the photon energy computed in cones of size AR < 0.1 (top 
left), AR < 0.2 (top right), AR < 0.3 (bottom left), AR < 0.4 (bottom right), for neutral mesons, 
fragmentation photons and partonic photons. 
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Table 14: Fraction represented by the 1 -fragmentation and 2-fragmentation contributions for various 
Frixione isolation criteria in Pythia two-prompt photon samples (with electromagnetic enrichment filter). 



Criteria 


1-frag fraction 


2-frag fraction 


1, 2-frag fraction 


Solid AR < 0.4, E'^° < 5 GeV 


0.335 


0.157 


0.492 


Hollow 0.1<AR< 0.4, E}f° < 4 GeV 


0.337 


0.168 


0.505 


Frixione 8 cones, Et = 20 GeV, e = 0.05, n = 1.0 


0.322 


0.145 


0.467 


Frixione 8 cones, Et = 20 GeV, e = 0.05, n = 0.2 


0.318 


0.147 


0.466 


Frixione 8 cones, Et = 20 GeV, e = 1.0, n = 0.2 


0.372 


0.228 


0.599 


Frixione 8 cones, Et = 20 GeV, e = 1.0, n = 0.1 


0.374 


0.232 


0.601 


Frixione 8 cones, Et = 20 GeV, e = 1.0, n = 1.0 


0.353 


0.192 


0.545 


Frixione 8 cones, Et = 20 GeV, e = 1.0, n = 0.5 


0.365 


0.212 


0.577 


Frixione 8 cones, Et = 20 GeV, e = 0.5, n = 1.0 


0.343 


0.176 


0.518 



19.32 Impact of hollow cones on the fragmentation contribution 

In NLO generators, the products of the quark fragmentation are along the fragmentation photon direc- 
tion. In parton-shower generators we have seen that this is not necessarily true. In NLO generators, 
the "hollow" or "crown" isolation, where the energy sum has to be below a fixed threshold in a region 
Ri < AR < i?2 while in the region R < Ri any arbitrary amount of energy is admitted, has been 
shown to enhance the fragmentation contribution with respect to the usual "solid" isolation. This "hol- 
low" isolation is interesting also because this criterion is closer than the "solid" cone to what is used 
experimentally (it allows to exclude from the isolation sum the energy deposited by the photon itself). 



The first two lines of Tables 14 and 15 show that in general the fragmentation fraction does not increase 
significantly when moving from solid to hollow cone isolation, for the PYTHIA samples. 



19.33 Impact of Frixione isolation on the fragmentation contribution 



Tables 14 and 15 report the fragmentation fraction inside the Pythia two-prompt sample for different 
Frixione isolation criteria. Eight isolation cones were used : AR < 0.05, AR < 0.1, AR < 0.15, 
AR < 0.2, AR < 0.25, AR < 0.3, AR < 0.35, AR < 0.4. The results are almost identical 
if instead four cones are used (0.1, 0.2, 0.3, 0.4), as found at NLO. The tables show that in both the 
electromagnetically-enriched samples and non-enriched samples, the discrete Frixione isolation with the 
usual functional form f{R) does not reduce the fragmentation contribution with respect to the standard 
isolation criterion (with a cone AR < 0.4) except when the parameter e is at its smallest value, e = 
0.05, for which modest reductions of between 5 and 8% can be achieved. The cause of this apparent 
non-optimal behavior can be explained by the non-coUinearity of the fragmentation debris around the 
fragmentation photon in PYTHIA. Frixione isolation is designed to apply tighter and tighter isolation 
criteria E^° < f{R) — )• as AR — )• 0, assuming that most of the fragmentation debris are around 
AR ~ 0. As it is seemingly not the case in the parton-shower Monte-Carlo studied here, the criterion 
loses most of its discrimination power. 

The previous study suggests that the previous working points studied with the Frixione functional 



form f{R) might not be optimal for the rejection of fragmentation debris. In figure 104 we compare the 
performance of three different sets of criteria: 1) non-Frixione isolation in a single cone AR < 0.4, 2) 
optimized working points for the parameters in the Frixione functional form (four cones 0. 1, 0.2, 0.3, 0.4 
were used to make the algorithm converge faster), 3) re-optimized 'Frixione' isolation criteria on cones 
AR < 0.1,0.2,0.3,0.4 without using the explicit functional form (we no longer constrain the events 
to satisfy E^° < f{R) and let f{R) free). In the second case, an optimization procedure is performed 
scanning over the parameters e and n to find the best working points (corresponding to a maximum 
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Table 15: Fraction represented by the 1 -fragmentation contribution for various Frixione isolation criteria 
in Pythia two-prompt photon samples (without enrichment filter). 



Criteria 


1-frag fraction 


Sohd AR < 0.4, E'^° < 5 GeV 


0.455 


Hollow 0.1<AR< 0.4, E'r^° < 4 GeV 


0.458 


Frixione 8 cones, Et = 20 GeV, e = 0.05, n = 1.0 


0.420 


Frixione 8 cones, Et = 20 GeV, e = 0.05, n = 0.2 


0.419 


Frixione 8 cones, Et = 20 GeV, e = 1.0, n = 0.2 


0.514 


Frixione 8 cones, Et = 20 GeV, e = 1.0, n = 0.1 


0.519 


Frixione 8 cones, Et = 20 GeV, e = 1.0, n = 1.0 


0.489 


Frixione 8 cones, Et = 20 GeV, e = 1.0, n = 0.5 


0.503 


Frixione 8 cones, Et = 20 GeV, e = 0.5, n = 1.0 


0.465 



efficiency for a given s/b). In the last case, an optimization code is used to find the best selection 
criteria to be applied on E^° for each AR cone. The optimization takes as input the target value of s/b 
(partonic signal over fragmentation background ratio), then relaxes and tightens each cut separately with 
an iterative procedure to find the best signal efficiency for this s/b target. The procedure was performed 
to find the working points corresponding to the s/b obtained with the first Frixione criterion. 



Figure 104 shows that the optimized working points for the Frixione functional form perform 
slightly better than the standard isolation for a given photon efficiency, and that optimization using no 
functional form in turn performs slightly better than the Frixione functional form; for the same value of 
single-photon efficiency, lower values of fragmentation fraction are attainable. It should be noted that 
this optimisation leads to a looser cut on the first cone, AR < 0.1, than the usual functional form. Nev- 
ertheless, to obtain reductions in the fragmentation fraction of more than 10%, increasingly significant 
reductions in single photon efficiency are required, since the fragmentation reduction becomes nearly 
flat. 

All in all, with the definition of the isolation in a cone of AR used here, which is a usual way of 
defining isolation at the experimental level (where one has however to remove the footprint of the photon 
from the isolation sum and to cope with pile-up), rejecting fragmentation photons can be done only at 
a cost of a lowered signal efficiency. With this optimization procedure it was found that to decrease the 
fragmentation fraction by 10%, a signal loss of about 60% has to be achieved, leading to extremely tight 
cuts probably not applicable in experimental analysis. 



CONCLUSIONS 

Firstly for the NLO cross sections, it was found that only one of the Frixione isolation criteria suggested 
in lITSl actually performs better at removing the fragmentation contribution in the inclusive case than that 
of the standard cone, although potentially too tight to use experimentally. However, it is useful to see 
that the results are independent of the number of cones used. This is also the case for the di-photon cross 
section where it compares well to the continuous criteria. A more promising result in the inclusive case 
is that the generalized version of Frixione isolation, with small values of e, do significantly reduce the 
fragmentation fraction without applying too tight a cut in the smallest cone. In addition it is confirmed 
that the 'saddle point' can not be found when altering the scale choice for the inclusive cross-section, 
suggesting more connections needed at NNLO. Finally when moving to the photon with associated jet 
cross section, from the inclusive cross section, it is reported that the jet algorithm (or size) used at NLO 
makes Uttle difference to the cross section. 

In the study at the parton-shower level it was shown that the isolation profile of fragmentation 
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Fig. 104: The fraction of tlie 1 -fragmentation contribution vs single 7 efficiency for various sets of 
criteria. Blue : tested working points reported on table 15 Black : selection criteria on isolation in 
AR < 0.4. Green : Optimized Frixione isolation using the usual functional form. Red : Re-optimized 
isolation criteria on cones AR < 0.1, 0.2, 0.3, 0.4 without using the functional form. 




photons is no longer collinear anymore, whereas the modelization of the fragmentation function in NLO 
generators leads the quark/gluon debris to be collinear to the photon. Furthermore, the hadronization 
process of the quark/gluon that emitted the photon leads to a AR profile which is no longer peaked at 
zero (but close to zero). In parton-shower programs isolation still has increasing discriminating power 
when going lower in AR. However, it was found that a 10% decrease in fragmentation fraction in di- 
photon events with respect to standard isolation leads to a drop in single photon signal efficiency to 
approximately 60% of the initial value. The usual functional form for Frixione isolation was shown to 
be not completely optimal for suppressing the fragmentation contribution while preserving high signal 
efficiency. This can be mitigated by re-optimizing the cuts for each AR of the discrete Frixione pre- 
scription, which allows a looser cut in the innermost cone. Further studies using other fragmentation 
modelizations in parton-shower programs like SHERR\ |427 1 (LO matrix-element where photons and 
jets in the shower are matched to matrix element level) or POWHEG 1148411 (NLO matrix-element with 
consistent fragmentation photon matching) would need to be investigated. 

In conclusion the results from the two studies show there are differences and similarities at the 
two levels. Regarding the fragmentation fraction, this is far more reduced at the NLO level than at the 
parton-shower level. However, the two levels show agreement that the results from Frixione isolation are 
independant of the number of cones used and that similar shape cuts can be obtained by retuning cuts at 
the parton-shower level and by using the generaUzed prescription at NLO. 
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APPENDIX: Selection details 

All of the studies are carried out for pp collisions at -v/5 = 7 TeV. For simplicity the the inclusive studies 
are carried out for the region where the photon lies in |?7| < 0.6. When calculating the cone isolation 
around the photon a cone of 0.4 is used with the requirement that the energy in the cone is less than 
4GeV. The renormalization scale fi and initial state factorization scale M are set to the photon px, 
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unless stated otherwise, and the CTEQ6.6 PDF II256II is used and the photon fragmentation functions are 
BFG set II 114791 ■ For the generalized Frixione isolation case, the cones used are: R = 0.4, 0.35, 0.3, 0.25, 
0.2, 0.15 and O.I. 

In the NLO di-photon studies, the photons have: p^ < 25 GeV, p^ > 22 GeV, in the rapidity 
range |r?''^| < 2.5 for both photons, and a separation AR^^ > 0.4 is required between the two photons. 
The mass range considered is 40 GeV < m^j < 300 GeV. In this case the scales /x and M are chosen 
equal to min {p^ ,P^ }■ 

In the parton shower di-photon studies, the photons are selected with: M^j > 80 GeV, px >21,20 
GeV, |?7| <2.5 and E'^° < 5 GeV. 

20. EVENT-BY-EVENT PILEUP SUBTRACTION USING JET AREAS ^ 

Abstract 

In these proceedings, we compare the efficiency of several jet-area-based sub- 
traction methods to correct for pile-up contamination at hadronic colliders. We 
study the dependence on various variables like the pt and rapidity of the jets, 
the number of pile-up vertices or the Monte-Carlo generator variations. We 
conclude that estimations of the pile-up density using a median computed over 
grid-cell patches, including a rescaling to correct for the rapidity dependence, 
perform particularly well, though alternative methods are possible. 

20.1 Introduction 

With the LHC running at larger and larger luminosities, hard pp interactions are accompanied by an 
increasing number of pile-up (PU) collisions: from a few PU events per bunch crossing in spring 201 1, 
operation with ~ 20 PU events is now routine. Considering only in-time PU, this would lead to an extra 
transverse momentum of ~ 750 GeV deposited in the event, and a jet of a typical radius R = 0.5 would 
see its transverse momentum shifted by ~ 10 GeV. In order to obtain a good energy resolution for the 
jets it is therefore mandatory to correct for this contamination. 

In these proceedings, we review several methods — both existing methods and new refinements 

— to subtract the contamination due to PU and provide a systematic study of their efficiency. 

It is important to note already now that PU has not only the effect to shift the momentum of the jets: 
it also smears their momentum. Indeed, the number of PU vertices varies from one collision to the next 
(following a Poisson distribution varying with the beam conditions), all PU interactions, i.e. minimum 
bias collisions, do not lead to the same energy deposit, and finally, the energy produced in a minimum 
bias collision is not deposited uniformly across the detector. Altogether, on top of an average shift, 
PU will add two sources of resolution smearing: event-to-event and in-event fluctuations corresponding 
respectively to variations of the PU activity from one event to another and from one point to another in a 
single event. 

Here we shall primarily study in-time PU, that is the effects coming from multiple pp interactions 
that occur in the same bunch crossing as the hard interaction one triggers on. Because of the response 
time inherent to each detector this would come with a second effect, out-of-time PU, corresponding to the 
PU activity in the few bunch crossings preceding the one with the hard interaction. Since these heavily 
depend on the details of each individual detector — and even varies from one sub-detector to another 

— it goes beyond the scope of this theoretical study. However, as we shall discuss in further detail later 
on, the PU subtraction methods proposed here do not make any assumption about a distinction between 
in-time and out-of-time PU and thus should be robust enough in more complex cases. 



""Contributed by: M. Cacciari, G. P. Salam and G. Soyez 
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20.2 Subtraction method(s) 

We are interested in the situation where a hard event is contaminated by a background coming from 
additional pileup interactions. A reconstructed jet in thai full event (hard event + background), which 
we shall call a full jet, differs from the hard jet in the original hard event because of the presence of 
the background. By background subtraction, we mean correcting the full jet in such as to recover the 
momentum of the original hard jet, i.e. subtract the pileup contamination from the jet's momentum. 

20.21 Background effects 

Our starting point is to realise [485 1 that a uniform background affects the momentum of a jet in two 
ways: it shifts its momentum because of the background particles clustered with the jet, and it modifies 
the way the hard particles themselves are clustered because the background particles are not infinitely 
soft. 

This means that the reconstructed momentum has the fornr^ 



PtJuU 



- Pt,hard + pA± ay/A + Apf^ (20.2.1) 



where pt denotes the transverse momentum of the reconstructed jet, Pi,hard the momentum of the original 
hard jet (in the absence of PU), A the jet area, p the background density per unit area within a given event, 
a the fluctuations of that background (per unit area) from place to place within the event, and Ap^^ the 
back-reaction describing the effect of the background particles on the clustering of the hard ones. 

If the background has a positional dependence (e.g. depends on rapidity) then p and a will depend 
on the position of the jet one tries to subtract. 

Eq. ( |20.2.1 1 characterises the fact that the background has the effects of shifting the transverse 



momentum of the jet and to degrade its resolution. The shift comes from the "p" term in (20.2.1 1 and 
from potential back-reaction systematic effects. Using the anti-kt jet algorithm the shift due to back- 
reaction is negligiblerj Resolution smearing effects come from various sources: the fluctuations of the 



background from within an event, i.e. the "a" term in (20.2.11, fluctuations of the background from 
one event to another, that is the fact that p is not the same in every event, and the fluctuations in the 
back-reaction. 

20.22 Central subtraction formula 



From (20.2.1 1, the natural way to subtract the background contamination is to define the subtracted jet 
as 114851 

Pt.sub =Pt- PestA (20.2.2) 

where pcst is the estimated value for the background density per unit area. 

To apply this subtraction we need to compute the jet area and find an estimation pcst for the 
background density per unit area. The jet areas are readily available using FastJet, so we just need 
to focus on Pcst- The main goal of these proceedings is to investigate various methods of obtaining 
Pest which are listed below. In all cases, it is primordial to realise that the determination of pcst is 
performed event-by-event, and even jet-by-jet when the positional dependence of the background is 
taken into account. 

As we shall see later on, the fact that p is estimated for each individual event is crucial: it corrects 
for the fluctuations of the background from one event to another. If instead one uses an averaged value 
for Pest (over many events), one would get an extra resolution smearing due to the fluctuations of p across 



different events. Similarly, the jet area A in (20.2.21 has to be computed for each individual jet. Using 



an average area would lead to an additional source of fluctuations of the form py^{A'^) — (A)"^. 



^"TMs can be defined for the 4-momentum of the jet but we shall only discuss its transverse momentum for simplicity. 
"For the kt or Cambridge/ Aachen algorithms, it is usually negative and can be of the order of a GeV. 
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Using seen vertices Since experimentally it might be possible — within some level of accuracy that 
goes beyond the scope of this discussion — to count the number of pileup vertices using charged track 
reconstruction, one appealing way to estimate the background density in a given event would be to count 
these vertices and subtract a pre-determined number for each of them: 

Pest^\y) = f{y) '^PU.seen, (20.2.3) 

where we have made explicit the fact that the proportionality constant f{y) can carry a rapidity depen- 



dence, /(y) can be studied from minimum bias collisions (see Section 20.32 below) and can take into 



account the fact that only a fraction of the PU vertices will be reconstructed. 

Median subtraction This technique divides the rapidity-azimuthal angle plane in patches and esti- 
mates p for each event using 

^2°^=^^) = median 1^1 (20.2.4) 

JG patches ^_ Ai J 

This is motivated by the observation that many regions in the event are populated just by the background. 
In these regions, pt/A is an estimate of p and the use of the median, rather than the average, which 
ensures reduced bias from the hard jets. 

This method was originally proposed in II485II using jets (from a kt or Cambridge/Aachen cluster- 
ing) as patches. Here, we shall also test a new option where the y — 4> plane is simply subdivided into 
grid cells that we use as patches. 



Using a local range Eq. ( 20.2.4 1 provides a unique, global, estimate of p for the event but does not take 
into account the positional-dependence of the background. One option, assuming one wants to estimate 
p at the location of a jet j, is to limit the computation of the median to the jets in the vicinity of j, that 



m 



(local) / .\ T I PtA 



Pr^(j) = . medial, r^ (20.2.5) 

where 'Tl{j) is a local range around j. A typical example, that we shall study later on, is the case of a 
strip range where only the jets with \y — yj\ < Z\ are included. This option was already proven to be 
powerful in 1.486.1 . 

Using rescaling Another option to correct for the rapidity dependence of the backgrouncP^is to intro- 



duce a pre-computed rapidity-reshaping function f{y) (see Section 20.32 1 and use 



1^ ' I 1 1 \ T I 1 1 \ m fii/H T o n J 1 



p):r' (y) = f{y) median -^^ (20.2.6) 

«e patches lAiJ{yi) J 

where now all patches (jets or grid cells) are included in the computation of the median. 

20.3 Performance tests 

20.31 Testing framework 

The remainder of these proceedings will be devoted to an in-depth comparison of the subtraction methods 



proposed in Section[20.2[ Our testing framework will be very similar to the one used in ['4861: we embed 



a hard event into a pileup background (see again Section 20.2 we reconstruct and subtract the jets in 



"■^This option will only be considered in the case where jets are used as patches. 

^^The same technique should also work for the azimuthal-angle dependence of the underlying-event in heavy-ion collisions. 
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both the hard and full eventq^ for each jet in the hard event, we find the matching jet in the full event 
and compute the shift 

A full, sub hard, sub ^^^ t i \ 

^Pt=Pt -Pt > (20.3.1) 

i.e. the difference between the reconstructed-and-subtracted jet with and without pileup. A positive (resp. 
negative) Apt would mean that the PU contamination has been underestimated (resp. overestimated). 

Though in principle there is some genuine information in the complete Apt distribution — e.g. it 
could be useful to deconvolute the extra smearing brought by the pileup, see e.g. I1486| and [487] — we 
shall focus on two simpler quantities: the average shift {Apt) and the dispersion a^p^. While the first 
one is a direct measure of how well one succeeds at subtracting the pileup contamination on average, the 
second quantifies the remaining effects on the resolution. One thus wishes to have (Apt) close to and 
(^Apt ^s small as possible. Note that these two quantities can be studied as a function of variables like 
the rapidity and transverse momentum of the jets or the number of pileup interactions. In all cases, a flat 
behaviour would indicate a robust subtraction method. 

The robustness of our conclusions can be checked by varying many ingredients: 

• one can study various hard processes with the hope that the PU subtraction is not biased by the 
hard event. In what follows we shall study dijets with pt ranging from 50 GeV to 1 TeV, as well as 
fully hadronic if events as a representative of busier final states. 

• The Monte-Carlo used to generate the hard event and PU can be varied. For the hard event, we have 
used Pythia 6.4.24 II400II with the Perugia 201 1 tune, Pythia 8.150 with tune 4C II348II and Herwig 
6.5.10 114881 with the ATLAS tune and we have switched multiple interactions on (our default) 
or off. For the minimum bias sample used to generate PU, we have used Pythia 8, tune 4C, and 
checked that our conclusions remain unchanged when using Herwig++ 1.489.1 (tune LHC-UE7-2). 

Additional details of the analysis For the sake of completeness, we list here the many other details of 
how the Apt analysis has been conducted: we have considered particles with \y\ < 5 with no pt cut or 
detector effect; jets have been reconstructed with the anti-kt algorithm with R = 0.5 keeping jets with 
\y\ < 4; for area computations, we have used active areas with explicit ghosts with ghosts placecFjup 
to |y| = 5; for jet-based background estimations, we have used the kt algorithm with R = 0.4 though 
other options will be discussed (and the 2 hardest jets in the set have been excluded from the median 
computation to reduce the bias from the hard event); for grid-based estimations the grid extends up to 
I y I = 5 with cells of edge-size 0.56 (other sizes will be investigated); for estimations using a local range, a 



strip range of half-width 1.5 has been used and we refer to the Section 20.32 below for more information 
about the rapidity rescaling. Jet reconstruction, area computation and background estimation have all 
been carried out using FastJet (v3) 1136 11 1435 1. Pile-up is generated as a superposition of a Poisson- 
distributed number of minimum bias events and we will vary the average number of pileup interactions. 
We shall always assume pp collisions with ^/s = 7 TeV. Finally, the matching of a full jet to a hard 
jet is made by requiring that their common constituents contribute for at least 50% of the transverse 
momentum of the hard jet. We shall not discuss matching efficiencies here but they are extremely good: 
for a reconstructed (full) jet of 50 GeV and 20 PU events, the matching efficiency is 99.9% and this 
increases to 99.98% for pt > 50 GeV and 5 PU events and 99.995% for pt > 100 GeV and 20 PU 
events. 

20.32 Minimum bias and rapidity shape 



Before discussing the performances of the subtraction methods described in Section 20.2 there is still 



a building block that has to be discussed, namely the rapidity dependence of the background f{y) that 



54 r 



One may argue whether or not one should subtract the jets in the hard event. We decided to do so to cover the case where 
the hard event contains Underlying Event which, as a relatively uniform background, will also be subtracted together with the 
pileup. 

^^Note that we have used the ghost placement of FastJet 3 which differs slightly from the one in v2.4. 
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Fig. 105: Rapidity dependence of the transverse energy per unit area deposited in minimum bias events 
(obtained from Pythia 8, tune 4C). The normaUsation of the fit is such that /seen is the fraction of seen 
minimum bias events i.e. the fraction of events which have at least 2 charged tracks with \y\ < 2.5 and 
Pt> WO MeV. 



enters in Eqs. (20.2.3 1 and (20.2.61. Letting aside the question of in-time vs. out-of-time PU and non- 
linear effects in the detectors, the shape f{y) can be obtained directly from minimum bias events. 

In our case, we have generated minimum bias events with Pythia 8 (tune 4C) and studied the 
rapidity dependence of the transverse momentum deposited per unit area. The result is shown on Fig. 



105 together with a quartic fit. If f{y) is used to rescale median-based estimates of p, Eq. (20.2.61, 



any global normalisation factor would cancel, but in the case of Eq. (20.2.3 1 i.e. for the "seen vertices" 



method, the normalisation has to match what we mean by a seen PU vertex. In what follows, we shall 
define that as a minimum bias interaction that has at least 2 charged tracks with \y\ < 2.5 and pt > 100 
MeV, which corresponds to 69.7% of the eventq^ In these conditions, we have found that the rapidity 
dependence is well reproduced by 



f{y) = 1.051141 



0.023608 y2 + 0.000026 yl 



(20.3.2) 



20.33 Generic performance and rapidity dependence 

Let us begin our performance benchmarks by the study of the rapidity dependence of PU subtraction. 



First of all. Fig. 106 shows the residual average shift {{Apt)) as a function of the rapidity of the hard 
jet. These results are presented for different hard processes, generated with Pythia 8 and assuming an 
average of 10 PU events per hard interaction. Robustness w.r.t. that choice will be discussed in the next 
Section but does not play any significant role for the moment. 

The first observation is that the subtraction based on the number of seen PU vertices does a very 
good job in all 3 cases. Then, global median-based (using jets or grid cells) estimations of p, i.e. the (red) 
square symbols, do a fair job on average but, as expected, fail to correct for the rapidity dependence of the 
PU contamination. If one now restricts the median to a rapidity strip around the jet, the (blue) triangles, 
or if one uses rapidity rescaling, the (black) circles, the residual shift is very close to 0, typically a few 
hundreds of MeV, and flat in rapidity. 

Note that the strip-range approach seems to have a small residual rapidity dependence and overall 
offset for high-pt processes or multi-jet situations. That last point, more clearly observed with some 



*This is a bit optimistic but does not affect in any way our discussion. 
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Fig. 106: Residual average shift as a function of the jet rapidity for all the considered subtraction meth- 
ods. For the left (resp. centre, right) plot, the hard event sample consists of dijets with pt > 50 GeV 
(resp. dijets with pt > 400 GeV, and jets above pt > 50 GeV in ft events), generated with Pythia 8 (tune 
4C) in all cases. The typical PU contamination (for unsubtracted jets) is around 5 GeV. 



Monte-Carlo generators like Pythia 6 than with others, may be due to the fact that smaller ranges tend 
to be more affected by the presence of the hard jets (see e.g. Appendix A.2 of II486II ). an effect which is 
reinforced for multi-jet events. The fact that the residual shift seems a bit smaller for grid-based estimates 
will be discussed more extensively in the next Section. 

Next, we turn to the dispersion of Apt, a direct measure of the impact of PU fluctuations on the 



Pt resolution of the jets. Our results are plotted in Fig. 107 as a function of the rapidity of the hard jet 
(left panel), the number of PU vertices (central panel) and the transverse momentum of the hard jet (right 
panel). All subtraction methods have been included as well as the dispersion one would observe if no 
subtraction were performed. 

The results show a clear trend: first, a subtraction based on the number of seen PU vertices bring 
an improvement compared to not doing any subtraction; second, median-based estimations of p give a 
more significant improvement; and third, all median-based approaches perform similarly well. 

The reason why median-based estimations of p outperform the estimation based on the number 
of seen PU vertices is simply because minimum bias events do not all yield the same energy deposit 
and this leads to an additional source of fluctuations in the "seen vertices" estimation compared to all 
median-based ones. This is the main motivation for using an event-by-event determination of p based 
on the energy deposited in the event. This motivation is further strengthened by the fact that additional 
issues like vertex resolution or out-of-time PU would affect both (Apt) and cr/^p^ if estimated simply 
from the number of seen vertices while median-based approaches are more robust. 

Note finally that even though local ranges and rapidity rescaling do correct for the rapidity depen- 
dence of the PU on average, the dispersion still depends on rapidity. The increase with the number of 
PU vertices is in agreement with the expected ^/npy behaviour and the increase with the pt of the hard 
process can be associated with back-reaction, see |486|. These numbers can also be compared to the 
typical detector resolutions which would be ~10 GeV for 100 GeV jets and ~20 GeV at pt = 400 GeV 
114901 I49T1I . 



20.34 Robustness and Monte-Carlo dependence 

The last series of results we want to present addresses the stabihty and robustness of the median-based 
estimation of the PU density per unit area. 

To do that, the first thing we shall discuss is the Monte-Carlo dependence of our results. In Fig. 
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Fig. 107: Dispersion a Apt- Each curve corresponds to a different subtraction method and the results are 
presented as a function of different kinematic variables: left, as a function of the rapidity of the hard jet 
for a sample of jets with pt > 100 GeV and assuming an average of 10 PU events; centre: as a function 
of the number of PU events for a sample of jets with pt > 100; right: as a function of the pt of the hard 
jet, assuming an average of 10 PU events 
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Fig. 108: Dependence of the average pt shift as a function of the number of PU vertices for various 
Monte-Carlo generators. For the left plot, the hard sample is made of dijets with pt > 100 GeV while 
for the right plot, we have used a hadronic ti sample. For each generator, we have considered both the 
case with the Underlying Event switched on (filled symbols) and off (open symbols). All results have 
been obtained using a grid-based median estimation of p using rapidity rescaling. 
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Fig. 109: Average residual shift after PU subtraction. {Apt) is plotted as a function of the pt of the jet 
for an average of 10 PU events (left panel), or as a function of the number of PU vertices for dijets with 
Pt > 100 GeV (central panel) and for tt events (right panel). In all cases, we compare 3 methods: the 
rapidity-strip range, (red) triangles, the jet-based approach with y-rescaling, (blue) circles, and the grid- 
based approach with y rescaling, (black) squares. Each curve is the result of averaging over the various 
Monte-Carlo generator options and the dispersion between them is represented both as error bars on the 
top row and directly on the bottom row. 



108 we compare the different Monte-Carlo predictions for the (Apt) dependence on the number of PU 



vertices in the case of a grid-based median estimate of p with rapidity rescaling. For each of the three 
considered Monte-Carlos, we have repeated the analysis with and without Underlying Event (UE) in the 
hard event. The first observation is that all the results span a range of 300-400 MeV in Apt and have 
a similar dependence on the number of PU vertices. The dependence on npu is flat for dijet events but 
shows a small decrease for the busier tt events. The 300-400 MeV shift splits into a 100-200 MeV effect 
when changing the generator, which is likely due to the small but non-zero effect of the hard event on 
the median computation, and a 100-200 MeV effect coming from the switching on/off of the UE. 

This question of subtracting the UE deserves a discussion: since the UE is also a soft background 
which is relatively uniform, it contributes to the median estimate and, therefore, one expects the UE, or 
at least a part of it, to be subtracted together with the PU. Precisely for that reason, when we compute 
Apt, our subtraction procedure is not applied only on the "full jet" (hard jet-i-PU) but also on the hard jet, 
see Eq. ( |20.3.1 1. The 100-200 MeV negative shift observed in Fig. 108 thus means that, when switching 
on the UE, one subtracts a bit more of the UE in the full event (with PU) than in the hard event alone 
(without PU). This could be due to the fact (see [492 1 for details) that for sparse events, as is typically 
the case with UE but no PU, the median tends to slightly underestimate the "real" p, e.g. if half of the 
event is empty, the median estimate would be 0. This is in agreement with the fact that for tt events, 
where the hard event is busier, switching on the UE tends to have a smaller effect. Note finally that as 
far as the size of the effect is concerned, this 100-200 GeV shift has to be compared with the ~1 GeV 
contamination of the UE in the hard jets. 

Finally, we wish to compare the robustness of our various subtraction methods for various pro- 
cesses i.e. hard events and PU conditions. In order to avoid multiplying the number of plots, we shall 
treat the Monte-Carlo (including the switching on/off of the UE) as an error estimate. That is, an average 
measure and an uncertainty will be extracted by taking the average and dispersion of the 6 Monte-Carlo 



setups. The results of this combination are presented on Fig. 109 for various situations and subtraction 
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Fig. 1 10: Left: relative difference between the reconstructed jet and the reconstructed Z boson transverse 
momenta. Right: at a given -pt of the reconstructed Z boson, difference between the reconstructed 'pt of 
the jet and the ideal -pt with no UE or PU, i.e. pt shift w.r.t. the "noUE" curve, the (black) triangles, on 
the left panel. See the text for the details of the analysis. 



methods. For example, the 6 curves from the left plot of Fig. 108 have been combined into the (black) 



squares of the central panel in Fig. 109 



Two pieces of information can be extracted from these results. First of all, for dijets, the quality of 
PU subtraction is, to a large extent, flat as a function of the pt of the jets and the number of PU vertices. 
When moving to multi-jet situations, we observe an additional residual shift in the 100-300 MeV range, 
extending to ~500 MeV for the rapidity-strip-range method. This slightly increased sensitivity of the 
rapidity-strip-range method also depends on the Monte-Carlo. While in all other cases, our estimates 
vary by ~ 100 MeV when changing the details of the generator, for multi-jet events and the rapidity- 
strip-range approach this is increased to ~200 MeV. 

Overall, the quality of the subtraction is globally very good. Methods involving rapidity rescaling 
tends to perform a bit better than the estimate using a rapidity strip range, mainly a consequence of the 
latter's greater sensitivity to multi-jet events. In comparing grid-based to jet-based estimations of p, one 
sees that the former gives slightly better results, though the differences remain small. 

Since the grid-based approach is considerably faster than the jet-based one, as it does not require an 
additional clustering of the evenf^ the estimation of p using a grid-based median with rapidity rescaling 
comes out as a very good default for PU subtraction. One should however keep in mind local-range 
approaches for the case where the rapidity rescaling function cannot easily be obtained. 

20.4 PU V. UE subtraction: an analysis on Z+jet events 

To give further insight on the question of what fraction of the Underlying Event gets subtracted together 
with the pileup, we have performed an additional study of Z+jet events. We look at events where the Z 
boson decays into a pair of muons. We have considered 5 different situations: events without PU or UE, 
events with UE but no PU subtracted or not, and events with both UE and PU again subtracted or not. 
Except for the study of events without UE, this analysis could also be carried out directly on data. 

Practically, we impose that both muons have a transverse momentum of at least 20 GeV and have 
\y\ < 2.5, and we require that their reconstructed invariant mass is within 10 GeV of the nominal Z 



Note that the clustering of the main event still needs to include the computation of jet areas since they are needed in 
Eq. (|20.2.2b. 
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mass. As previously, jets are reconstructed using the anti-fct jet algorithm and the pileup subtraction is 
performed using the grid-based-median approach with rapidity rescaling and a grid size of 0.55. All 
events have been generated with Pythia 8 (tune 4C) and we have assumed an average PU multiplicity of 
20 events. 



In Fig. 1 10 we have plotted the ratio Ptjct/pt,z — 1. with ptjet the transverse momentum of the 
leading jet, for the various situations under considerations. Compared to the ideal situation with no PU 
and no UE, the (black) triangles, one clearly sees the expected effect of switching on the UE, the empty 
(green) circles, or adding PU, the empty (red) squares: the UE and PU add to the jet ~1.2 and 13 GeV 
respectively. 

We now turn to the cases where the soft background is subtracted, i.e. the filled (blue) squares and 
(magenta) circles, for the cases with and without PU respectively. There are two main observations: 

• with or without PU, the UE is never fully subtracted: from the original 1-1.5 GeV shift, we do 
subtract about 800 MeV to be left with a 0-500 MeV effect from the UE. That effect becomes 
smaller and smaller when going to large pt. 

• in the presence of PU, the subtraction produces results very close to the corresponding results 
without PU and where only the UE is subtracted. This nearly perfect agreement at large ptjet 
slightly degrades into an additional offset of a few hundreds of MeV when going to smaller scales. 
This comes about for the following reason: the non-zero pt resolution induced by pileup (even 
after subtraction) means that in events in which the two hardest jets have similar pt, the one that is 
hardest in the event with pileup may not correspond to the one that is hardest in the event without 
pileup. This introduces a positive bias on the hardest jet pt (a similar bias would be present in 



real data even without pileup, simply due to detector resolution). The "matched" curve in Fig. 1 10 
(right) shows that if, in a given hard event supplemented with pileup, we explicitly use the jet that 
is closest to the hardest jet in that same event without pileup, then the offset disappears, confirming 
its origin as due to resolution-related jet mismatching. 

20.5 Conclusions and discussion 

In these proceedings, we have investigated several methods to correct for the pile-up contamination 
to jets. They are all based on the observation that the average PU contribution to a jet is on average 



proportional to its area, which directly leads to eq. ( 20.2.2 1. The various methods then differ by the 
method used to estimate the PU activity per unit area, p. The subtraction efficiency has been studied by 
embedding hard events into PU backgrounds and investigating how jet reconstruction was affected by 
measuring the remaining pt shift after subtraction {{Apt)) as well as the impact on resolution {aApt)- 

There are 3 broad approaches to the estimation of p: (a) using an average contamination per PU 
vertex, the seen vertices approach, (b) using an event-by-event estimation and, the median approach with 
jets or grid cells as patches, and (c) using an event-by-event and jet-by-jet method, the local range or 
rescaling approaches. 

The first important message is that, though all methods give a very good overall subtraction 
{{Apt) ~ 0), event-by-event methods should be preferred because their smaller PU impact on the pt 
resolution (see Fig. |107| ). This is mostly because the "seen vertices" method has an additional smear- 
ing coming from the fluctuations between different minimum bias collisions. This does not happen in 
event-by event methods that are only affected by point-to-point fluctuations in an event. Note also that 
event-by-event methods are very likely more robust than methods based on identifying secondary vertices 
when effects like vertex identification and out-of-time PU are taken into account. 

The next observation is that event-by-event and jet-by-jet methods have the additional advantage 



that they correct for positional-dependence of the background like its rapidity dependence (see Fig. 106 1. 
The median approach using a local range (with jets as patches) or rapidity rescaling (using jets or grid 
cells as patches) all give an average offset in the 0-300 MeV range, independently of the rapidity of 
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the jet, its pt or the number of PU vertices, see Fig. 109 and are thus very suitable methods for PU 
subtraction at the LHC. Pushing the analysis a bit further one may argue that the local-range method has 
a slightly larger offset when applied to situations with large jet multiplicity like tt events (the right panel 



of Fig. 109 1 though this argument seems to depend on the Monte-Carlo used to generate the hard-event 
sample. Also, since it avoids clustering the event a second time, the grid-based method has the advantage 
of being faster than the jet-based approach. 

At the end of the day, we can recommend the median-based subtraction method with rapidity 
rescaling and using grid cells as patches as a powerful default PU subtraction method at the LHC. But 
one should keep in mind that the use of jets instead of grid cells also does a very good job and that 
local-ranges can be a good alternative to rapidity rescaling if the rescaling function cannot be computed. 
Also, though we have not discussed that in detail, a grid cell size of 0.55 is a good default as is the use 
of kt jets with i? = 0.4. 

To conclude, let us make a few general remarks. First, our suggested method involves relatively 
few assumptions, which helps ensure its robustness. Effects like in-time v. out-of-time PU or detector 
response should not have a big impact. Many of the studies performed here can be repeated with "real 



data" rather than Monte-Carlo simulations. The best example is certainly the Z-f-jet study of Section 20.4 
which could be done using data samples with different PU activity from 2010 and 201 1. Also, the rapidity 
rescaling function can likely be obtained from minimum bias collision data and the embedding of a hard 
event into pure PU events could help quantifying the remaining 0(100 MeV) bias. Experimentally, it 
would also be interesting to investigate hybrid techniques where one would discard the charged tracks 
that do not point to the primary vertex and apply the subtraction technique described here to the rest 
of the event. This would have the advantage to further reduce fluctuation effects (roughly by a factor 
~ a/1/(1 — /chg) ~ 1-6, where f^hg ~ 0.61 is the fraction of charged particles in an event). Finally, all 
the facilities to compute jet areas and background estimation — including jets or grid-cells as patches, 
local ranges and rescaling functions — are readily available from FastJet (v3.0.0 onward) using e.g. the 
GridMedianBackgroundEstimator or Subtracter tools. 
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Part VI 

MC TUNING AND OUTPUT FORMATS 

21. TUNE KILLING: QUANTITATIVE COMPARISONS OF MC GENERATORS AND 

TUNES 

Abstract 

We summarise the implementation, status, and scope of the "tune killing" 
project, which classifies MC generator codes and tunes according to their qual- 
ity of data description across a range of LHC-relevant observables. The pri- 
mary aim of the project is to provide sufficiently clear information about gen- 
erator performance that the current large collection of available tunes may be 
objectively reduced to a more manageable standard set for common use by 
LHC experiments and phenomenologists. We make final recommendations as 
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to which generators and tunes are in rude health, and those which are obvious 
candidates for retirement from active service. 

21.1 INTRODUCTION 

Popular MC generators are nowadays associated with a bewildering array of standard parameter config- 
urations, called "tunes". This proliferation of tunes is due to the ongoing project to provide optimised 
descriptions of LEP, Tevatron and LHC data: as new data and techniques have become available, new 
tunes have been created, usually but not always with increasing quality of data description. This pro- 
cess looks set to continue, and hence there is a need for agreement on which tunes are of most common 
interest at a given time. 

The PYTHIA6 II400II event generator in particular has been the de facto testbed for tuning due to the 
wealth of community expertise and its ubiquity of tuning parameters for physical processes. At the 
time of writing there are 77 tunes available via the built-in PYTUNE routine, and a further 10 or more 
presented by the ATLAS experiment alone (this counting of ATLAS tunes includes equivalently weighted 
tunes for multiple PDFs, but not systematic variation tunes, of which there are many more). With such 
a profligacy of configuration options, it is difficult to objectively decide which are to be preferred for 
LHC simulation without manually cross-referencing hundreds of plots. It is hence not uncommon for 
different experimental or phenomenological studies to use entirely disjoint MC generator setups, making 
comparison difficult. Ideally we would have a much smaller set of agreed-upon generator setups, but 
choosing such a privileged subset requires clear information on which to base our preferences. 

As a first step to addressing this issue, we present here a comparative study of event generator codes 
and tunes across a range of observables, particularly those of relevance for LHC physics. The study 
is based on analyses from the Rivet II360II toolkit, and the resulting data descriptions are quantitatively 
scored based on measures of deviation from the data values, including y^ and median/maximum bin- 
wise deviations (in units of combined experimental, statistical, and theoretical uncertainties). The results 
are presented as a series of Web pages, using colour coded tables which are hyperlinked to provide the 
necessary information in a compact, hierarchical form. 

21.2 Analysis system 

The data analysed for this project was produced by individual runs of various generator/tune configura- 
tions into the Rivet analysis system. A choice of Rivet analyses was made, intended to cover a number 



of core QCD modelling aspects for LHC physics: these are documented in Table 16 In some cases only 
the most relevant range in the distribution is included, as indicated in the Table. The generators and tunes 
used are documented in Table [T/l 

Note that not all observables are suitable for all generators. For example, AlpGen has not been used 
for LEP fragmentation, although in a future iteration we will extend the AlpGen coverage to include 
underlying event observables, where the hard jets could interfere with those from the multiple parton 
interaction (MPI) mechanism. Several observables, notably hard photon physics, minimum bias observ- 
ables, and fragmentation/strangeness from RHIC and LHC have not yet been included: this is envisaged 
as a future extension of the project. 

A Python program was written to load the histogram files for each generator/tune combination from a 
hierarchical directory structure, and to perform some basic statistical characterisation on each bin, his- 
togram, and semantic group of histograms. At the histogram level, specifications are used to determine 
which bins are to be considered in the statistical comparisons, and to add a nominal "theoretical uncer- 
tainty". In this study a 10% theoretical uncertainty was added to the underlying event and fragmentation 
observables and a 5% theoretical uncertainty on the rest. The combined uncertainty for each bin b is 
then computed from the sum in quadrature of the reference data error, the MC statistical error and the 
theoretical uncertainty, and is used to compute a MC-data deviation for that bin, expressed in units of 
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Observable 



Rivet analysis 


Ref. 


Range 


ATLAS _2010_S8894728 


14931 


p'l^'' > 5 GeV 


ATLAS _2010_S8894728 


1493^ 


p'X^''>5GeV 


ATLAS _2010_S8894728 


|493,1 




ATLAS^010_S8894728 


14931 


p^l'"^>5GeY 


ATLAS _2010_S8894728 


14931 


p'|^'^>5GeV 


ATLAS _2011_S 8924791 


L4941 




ATLAS _2011_S 8924791 


14941 




ATLAS _2011_S8971293 


n495l 


Sir /A — ;> vr 


ATLAS ^01 1_S 897 1293 


14951 


37r/4 — ;> vr 


ATLAS _2010_S88 17804 


f382,l 




CMS_2011_S8957746 


14961 




D0_2010_S8821313 


14971 


(/.* < 0.4 


D0^010_S8821313 


14971 


(/.* < 0.4 


DELPHL1996_S3430090 


f498l 




PDG_HADR0NJV1ULTIPLICITIES 


1499] 




DELPHL1996_S3430090 


[498] 




DELPH0002_069_CONF_603 


[500] 





Underlying event 

Transverse region A'ch vs. p^^'^,p^^ > 500 Me V 
Transverse region ^p_L vs. p^^'^,p^^ > 500 MeV 
Transverse region {p±) vs Nch,p^}^ > 500 MeV 

Jets 

Toward region A^ch vs p^l^'^,pf > 500 MeV 
Toward region ^ p± vs p^^'^, p^ > 500 MeV 
Jet shapes, 30 < p_L < 40 GeV, \y\ < 2.8 
Jet shapes, 310 < p± < 400 GeV, \y\ < 2.8 
Dijet A(t>, no <p± < 160 GeV 
Dijet A(t>, 310 < p± < 400 GeV 
Dijet mass, 0.3 < |y| < 0.8, anti-A;_L(0.4) 
Transverse thrust, 90 GeV < p'^^ < 125 GeV 

ISR/intrinsic- k± 

D0(f>*, \y\ < 1.0 
D0(t)*,l.O< \y\ < 2.0 

Fragmentation 

iVch, vr+/7r-, K+/K- at LEP 
p/tt, K/tt, S^'^'^'^ /tt, p/tt, A/tt 
Inclusive Xp, thrust (+ major & minor) 
B fragmentation 



Table 16: Observables used in the tune killing exercise. 



Generator and version 



Tunes 



Default (CTEQ6.6) 

LHC-UE-EE-3 series (LO** and CTEQ6L1) 

4C 

D6T, DWlSnil, Z2, AMBTliOl, AUET2B (LO** and CTEQ6Ll)||50l, 

Perugia 2010(501, Perugia 201 l l504l . prof-Q^BSTl 

Same tunes as PYTHIA6. 

Perugia 2011 using matched ME/PS ylnm . 150611 
HERWIG 6.510 II488I + JIMMY 4.31 ll507ll AUET2 LO** l508l 
AlpGen + HERWIG 6.5 + JIMMY 4.3 1 (*) Same as for HERWIG+ JIMMY. 



Sherpa 1.3.1111461 
Herwig++ 2.5.2g89l 
Pythia8.150[348j 
PYTHIA 6.425 14001 

AlpGen[5051 + PYTHIA 6.425 (*) 



Table 17: Generators and tunes used in the tune killing exercise. (*) Jet and Z boson cf)* observables 
only. 
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the total bin error, dev^ = (MC^ — datafc)/err6. 

For each active histogram, the system then reports the x^/-^bin. and the median, mean, and maximum 
bin-wise deviation. A total "metric" value for each histogram is reported as the maximum bin-wise 
deviation if that is greater than lOa, otherwise the greater of the median and mean deviations. This hybrid 
treatment of the metric allows the system to flag up histograms in which there are either widespread 
moderate deviations or a small number of very discrepant bins which might be missed with a pure 
median or mean deviation treatment. An HTML table and set of histograms are rendered by the system 
for each observable, with a continuous colour coding scheme used to highlight the relative quality of data 
description from ideal (green) to very poor (red). 

The histograms are grouped to collect together observables from different sources which reflect related 
aspects of QCD modelling. The current groups are "Underlying Event (UE)", "Dijets", "Multijets", 
"Jet shapes", "W and Z", "Fragmentation", and "-B fragmentation". In these groups, the same x^/A'^bin. 
and mean/median/maximum deviation statistics are calculated as before. For visual compactness of 
classification we again use a hybrid performance metric for each histogram group: again this is the 
maximum bin-wise deviation found in the contained histograms if that is greater than lOcj, otherwise the 
maximum histogram- wise deviation metric in the group if that is greater than 5o", otherwise the maximum 
of the median/mean bin-wise deviation. 

The Web pages generated to present this data in a compact way consist of a single top level page con- 
taining a colour-coded table of tune performance metrics for each histogram group. Each cell in the table 
is hyperlinked to a more detailed table for that tune/group where the various x^ /N, max/mean/median 
deviation and hybrid metric are presented, again colour-coded, for each histogram in the group. The 
table rows are then hyperlinked to a plot page showing explicitly the tune/generator behaviour for each 
histogram and indicating the active range of the histograms where appropriate. These pages are shown in 



Figures 1 1 1 to 1 13 This form of presentation allows a rapid assessment of generator/tune performance, 
while still permitting detailed investigation of any flagged-up issues with a few mouse clicks. The system 
is easily extensible to more observables, groups, and different theory uncertainty / visual classification 
thresholds. 

The classification colours for each performance figure are generated in HSB colour space as a linear 
variation in deviation x between green (120) and red (0) in the Hue parameter, i.e. H = 120(1 — 
min(x/xbad5 1-0)), with fixed Saturation and Brightness parameters. The visual threshold Xbad was cho- 
sen to be different for each metric type: ba for maximum deviations, 4o" for 'x^ /N, and 2cr for mean and 
median deviations, and for the hybrid performance metrics. These thresholds were iterated from initial 
suggestions to the point where distinctions could be made between the models: similar iteration of the 
discriminating criteria are envisaged while significant model/tune variations exist as the motivation of 
this study is model discrimination rather than passing or failing a natural performance figure. 

21.3 Results 

As the central theme of this project has been to provide a comprehensible visualisation of the relative 
performance of generators and tunes, and hierarchical presentation via Web pages was key to achieving 
this, it would be self-defeating to attempt to present the same information in this summary. Additionally, 
the nature of tune comparison is that it evolves as new data, tunes, and generator versions become avail- 
able. Hence, for up-to-date status information we refer the reader to the persistent "tune killing" web 
page at http: //projects, hep forge, org/ rivet /tunecmp/i 

However, it is worth mentioning some of the most striking features of generators which have been made 
more evident by this collating of data-MC comparisons: 

• The general quality of jet and W/Z data description is in fact better than expected: among PYTHIA 
tunes in particular there is sufficient variation in parton shower parameters that significant devia- 
tions in jet observables would reasonably be expected, but in fact the majority of tunes describe 
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Tune comparisons 

Deviation metrics per gen/tune and observable group: 



Gen 


Tune 


UE 


Dijets 


Multijets 


Jet shapes 


W and Z 


Fragmentatian 


Bfrag 


AlpGen 

HERWIG 

Herwig++ 

PYTHIA6 

Pythias 
Sherpa 


HERWIG6 

PYTHIA6-AMBT1 

PYTHIA6-D6T 

PYTHIA6-P2010 

PYTHIA6 P2011 

PTTHIA6 Z2 

PYTHIA6 praf02 

AUET2 CTE06L1 

AUET2 LOxx 

2.5.1 UE EE 3 CTE06L1 

2.5.1-UE-EE-3-WRSTLOXX 

AWBTl 

AUET2B-CTE06L1 

AUET2B-L0XX 

D6T 

□W 

P2010 
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Fig. Ill: Screenshot of the top-level summary page produced by the tune comparison system. 
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Median deviation 


Mean deviation 


Max deviation 


Metric 


Jet shape s\rhos for Sp_\p... (ATLAS_2011_S8924791/d01-x06-y01) 
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Fig. 112: Screenshot of the mid-level performance metric page produced by the tune comparison system. 
This specific example is part of the performance metrics for the Herwig++ LHC-UE-EE-3 LO** tune. 
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Plots for PYTHIA6, AUET2B-LOxx, B frag 
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Fig. 113: Screenshot of the observable plot page produced by the tune comparison system. This specific 
example shows the B fragmentation performance of the PYTHIA6 AUET2B LO** tune. 

data fairly well. 

• The UE in particular has been a focus of tuning activity and this is evident in the consistency of 
UE data description. The worst performance in this group is from the DW tune of PYTHIA, but 
even this pre-LHC tune with the "old" PYTHIA MPI model achieves a deviation metric of less 
than la on LHC UE observables. 

• PYTHIA D6T outperforms PYTHIA DW - an unexpected result since the MPI energy evolution 
of D6T is fixed to the default and disfavoured form p\{s) ~ {^/s/l800Gey)^^^^, whereas the 
exponent in DW is closer to the tuned consensus of ~ 0.25. This may be a lucky behaviour at 
7 TeV, and hence care is needed with extrapolation of D6T to 8, 10, or 14TeV, but it is clear that 
the PYTHIA Q^-ordered parton shower is not yet dead on purely physics grounds. The best tune 
of this PYTHIA configuration, however, is Prof-Q^, which in addition to general small improve- 
ments, is significantly better than DW or D6T at describing the vector boson p± distribution. 

• PythiaS is generally seen to perform very well, and provides significant improvements over 
PYTHIA6 for jet shapes and B fragmentation. Tuning focus is accordingly beginning to shift 
towards PythiaS, also for minimum bias observables not yet considered here. 

• AlpGen interacts strongly with tunes on jet shape and vector boson data descriptions. In partic- 
ular there appears to be little motivation to use AlpGen with the D6T or Perugia 2010 tunes of 
PYTHIA6. AlpGen-i-HERWIG also has significant problems with jet shapes in particular, and the 
indication of this study is that AlpGen-i-PYTHIA Perugia 2011 is the most performant configura- 
tion, closely followed by AlpGen-i-PYTHIA Z2. Notably, the Perugia 2011 tune of PYTHIA was 
specifically developed to minimise ME/PS merging artefacts when used with AlpGen. 

• Both HERWIG and Herwig-i~i- have problems describing LEP fragmentation data, but Herwig-i~i- 
is a very significant improvement over its Fortran cousin. The identified hadron rates are in partic- 
ular much improved, although K^ and U^ remain anomalous. However, a known problem with 
Herwig-i~i- is the poor description of the LEP thrust distribution, which overshoots significantly in 
the multi-parton region. 

• AlpGen seems to have difficulty describing dijet azimuthal decorrelations, even when restricted to 
the 2/3 parton region of the plot. This is particularly surprising as AlpGen is intended to provide 
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the multi-parton configurations needed to describe this observable. 

• B fragmentation is in general quite poorly described. The best descriptions are by PythiaS, the 
AUET2B tunes of PYTHIA6, and Herwig++. Other generators and tunes are in decidedly dodgy 
shape for i3-specific predictions at the LHC. 

Insofar as it is within the scope of this project to make recommendations for canonical generator and 
tune choices, we note that the Perugia 2011, AUET2B, and Z2 tunes of PYTHIA6 provide the best 
data descriptions currently available with that generator and that the Prof-Q^ tune is the best available 
configuration using the Q^ -ordered PYTHIA parton shower. We hence recommend these 4 PYTHIA 
tunes as the current minimal set of PYTHIA tunes for general use at the LHC, particularly once an 
update of the ATLAS AUET2B tune has fixed the tuning issue with the Z p±. 

Among the other generators, where there is not such a proliferation of tunes, we note again the appar- 
ent performance issues with AlpGen - this is clearly in need of further pursuit. However, to reduce the 
amount of comparison needed, we note that Perugia 20 11 is the only PYTHIA6 tune now optimised for 
use with AlpGen with avoidance of the worst effects of ME/PS coupling mismatches: hence future stud- 
ies can quite happily restrict themselves to this AlpGen-i-PYTHIA configuration. As AlpGen-i-HERWIG 
has several problems with jet description, HERWIG itself has serious problems with both light and B 
fragmentation, and no further tuning of the JIMMY MPI model is envisaged, the HERWIG generator 
cannot be recommended for future use in any capacity where an alternative exists. 

The "new" C-i~i- generators Herwig-i~i-, PythiaS, and Sherpa all perform well, with the exception of 
Sherpa's B fragmentation and the Herwig-i~i- light fragmentation. PythiaS generally behaves well but 
some tuning or development may be needed to improve inter-jet observables and the Z p± spectrum. In 
general, the C-i-i- generators are in good health, and we anticipate further improvements as the focus of 
tuning studies shifts to them. 

21.4 Outlook 

This project has put in place a system and a set of classification criteria which have proven useful for 
summarising and investigating MC generator model and tune predictivity for a variety of QCD phenom- 
ena. While we claim no mandate to truly "kill" certain tunes or generators, and wish to emphasise that 
a poor performance in a single observable type (in particular B fragmentation) certainly does not render 
that generator useless, the results from these comparisons do provide strong arguments for deprecation 
of at least several PYTHIA6 tunes and of the Fortran HERWIG generator in general. 

It is the nature of a project like this that results are continually being updated, and there are many natural 
avenues for extension which we wish to pursue, in particular: 

• Extra observables, e.g. minimum bias and E± flow, LHC and Tevatron photon physics, LHC W/Z 
p± data, strangeness data from LHC and RHIC, explicit multijet observables, etc.. 

• Extra generators and tunes, in particular POWHEG-i-PYTHIA/HERWIG/PythiaS/Herwig-i-i-, Mad- 
Graph-i-PYTHIA/PythiaS, MC@NLO-i-IIERWIG/Herwig-i-i-. Comparison between Sherpa with 
the CTEQ6.6 and CTEQ6L1 PDFs. New PythiaS and PYTHIA6 tunes from ATLAS. 

Greater automation of the data generation will be important, as finding resources (human rather than 
CPU!) to produce and run combinatoric numbers of generator/tune/PDF/observable combinations has 
been troublesome. We suggest that this project can make use of the output of the CERN LPCC MCplots 
system (also Rivet-based) for future extension. We also look forward to a forthcoming major upgrade 
of the Rivet histogramming system which will greatly simplify the treatment of multi-leg generators for 
which the n-parton samples must be explicitly merged, e.g. AlpGen, MadGraph, etc. 

22. COMPACT ASCII OUTPUT FORMAT FOR HEPMC 
Abstract 

^^'Contributed by: A. Buckley, L. Garren, G. Hesketh, H. Hoeth, L. Lonnblad, E. Nurse, S. Platzer, G. Salam, G. Soyez 
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We discuss the possibility of reducing the footprint of HepMC event files. Dif- 
ferent compression options are discussed, and a suggestion for an update of 
the HepMC ASCII file format is presented. 

22.1 Introduction 

The HepMC II509II event record has become the de-facto standard for communicating events between 
event generators and different kinds of analysis programs. HepMC also provides an ASCII-based file 
format for storing and retrieving events to and from disk, which has also become the standard. This 
file format is not at all optimized for size, and although disk space today is fairly cheap, there are still 
problems associated with handling very large files. 

A typical minimum-bias 7 TeV LHC HepMC event occupies around 50 kB when written on disk. More 
interesting events are usually bigger than this and one would typically want to store many events to get 
anywhere near the statistics collected by any of the LHC experiments; it is clear that such event files will 
become very large and difficult to handle. Even with standard compression algorithms such as gzip and 
bzip2, where these file sizes can be reduced by a factor 3 or more, the problem is still substantial. 

One could imagine using a binary output format to reduce the event size. Writing a 4 byte floating point 
number in an ASCII file typically takes 10-12 characters, so here one could expect to reduce file sizes up 
to a factor 3. However, standard compression algorithms are rather good at identifying strings of numbers 
and compressing them, so there is normally not much to be gained by using a compressed binary format 
compared to a compressed ASCII one. In addition one would lose the advantage of ASCII files that they 
are (somewhat) readable to the human eye. 

Instead the key to reducing file sizes is to remove redundant and unnecessary information stored in the 
files. This could involve completely reversible operations such as removing the information about the 
momentum of an intermediate particle, as this can be reconstructed from its decay products. It could 
also involve irreversible operations such as reducing the precision on the momenta. In the following we 
describe a number of such operations, which allows us to reduce the file sizes by almost a factor 30. 

22.2 The Benchmarking procedure 

We started out by generating 1000 non-diffractive QCD events with Pythia 6.425 114001 using the AG- 
ILe II360II interface. The resulting file size was 48 MB, which can be reduced to 16 MB or 13 MB using 
gziporbzip2 respectively. We then investigated several ways of reducing this size. 

Removing irrelevant particles The HepMC format contains quite a lot of information about how the 
event was generated, such as intermediate particles in the hard sub process, which may be generator- 
dependent (and often unphysical) and is not relevant when comparing to experimental data. In principle 
one could argue that the only thing that should be written out is final-state stable particles (with HepMC 
status code 1). However, there are circumstances where information about intermediate unstable hadrons 
(status code 2) is relevant. The AGILe event generator interface already includes facilities for keeping 
only particle entries with status code 1 or 2. 

Reconstructible information Some information in the HepMC file is redundant in the sense that it 
can be reconstructed from other information in the file. Here are some examples. 

• Both energy, momentum and invariant mass of each particle is written out. Clearly, we can eg. 
reconstruct the energy given the three-momentum and mass^ 



^''One could also reconstruct the mass given energy and momentum, but this typically gives large precision problems for 
small masses. 
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Format 


Status codes 


no comp. 


gzip 


bzip2 






(MB) 


(MB) 


(MB) 


Standard 


All 


48 


16 


13 




1&2 


43 


15 


13 




1 


17 


6.0 


4.8 


Compact 


All 


18 


3.3 


2.1 




1&2 


13 


2.9 


1.9 




1 


4.0 


1.9 


1.6 


Compact binary 


1 


1.8 


1.7 


1.7 



Table 18: Size of the benchmark file after applying different compression methods. 

• The three-momenta of decayed hadrons (status code 2) can be reconstructed from the sum of the 
momenta of the decay products. 

• The mass of a stable particle can be deduced from the particle ID. 

• The position of a vertex can be deduced from the previous vertex position and the life-time and 
momentum direction of the connecting particle. 

• Each particle in a HepMC event has a unique bar code, which is an otherwise arbitrary integer. No 
loss of information would result from renumbering the particles, simply inferring their bar code 
from the order in which they appear in the event. 

Precision Clearly, having 8 byte floating point numbers is not very relevant for many of the numbers in 
an event file. When comparing with experimental data, there is no point in having much larger precision 
than what is achievable in the experiment, and it makes sense to match the the information in the HepMC 
file to the precision of the actually measured variables in the experiments. 

A possible example is to store masses and transverse momenta as integers in units of 0. 1 MeV, azimuthal 
angles as integers in units of 0.00002 xvr, pseudorapidities as integers in units of 0.00001 and vertex 
positions as integers in units of 0.001 mm. 

22.3 Benchmark Results 

We have investigated several of the options listed in the previous section, and the resulting file sizes 



when applied to the benchmark file is presented in table 1 8 Firstly we see the size reduction using the 



standard format and simply reducing the number of particles, keeping only those with status code 2 and 1 
or only 1 . Next we present the same results, but using a compact format which keeps the structure of the 
HepMC ASCII file but applies all optimizations discussed above. Finally, for reference, we present an 
aggressively compacted Binary format which uses the following optimizations for each particle: stores 1 
float for transverse and 1 float for longitudinal momentum, a 3-byte integer for phi, and 1 byte for PDG 
IDs (rare PDG IDs are written out with 4 full bytes). This format loses the HepMC structure of the event 
and in some sense this represents the target size, below which it is difficult to go. 

It is clear that one does not gain much by using a binary format provided one uses the optimizations 
presented above together with bzip2 compression algorithm. 

22.4 Outlook 

Given the results above, the work to include a more efficient file format for the HepMC has begun. The 
suggestion is to keep the current structure of the file format, but to add options to exclude all particles 
except those with status code 1 (or 2). Furthermore options for the representation and precision of 
momenta and vertex positions will be included as well as options for excluding (simply replacing with a 
single exclamation mark for easy parsing) information which can be reconstructed. The new format will 
be included in a forthcoming HepMC version during 2012. 
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In this report we have not looked carefully at the time it takes to read the different formats. With the 
default HepMC format this can be many times larger than the time taken for typical particle-level analy- 
ses, while for minimal binary formats it is of the same order. We defer detailed study of this question to 
future work. 
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